DOCUMENT RESUME 



ED 061 980 LI 003 650 

AUTHOR Lazorick, Gerald J- 

TITLE Demand Models for Books in Library Circulation 

Systems. Pinal Report. 

3PONS AGENCY National Science Foundation, Washington, D.C. Office 

of Science Information services. 

PUB DATE Jul 70 

NOTE 23 3 p. ; (48 References) 

EDRS PRICE MP— SO- 6 5 HC-J9.8 7 

DESCRIPTORS *Books ; ^Library Automation; ^Library Circulation ; 



Library Materials; ^Mathematical Models; ^Operations 
Re search 

ABSTRACT 

This research is a study of demands for books in 
library circulation systems. Demand data for random samples of books 
were collected and fitted to various standard distributions. The 
numbers of demands for collections of books are shown to be Negative 
Binomial ly distributed, as is shown, this implies that the numbers of 
demands for individual books in the collection are Poisson 
distrubutad ahd that the demand rate varies from book to book 
according to ja Gamma distribution. Using these facts and assuming 
Exponentially di st rubuted loan intervals, a model is developed which 
will predict the availability and unavailability of a book in a 
library. The practicality of using the model is demonstrated. 

(Author) 



O 

ERLC 



U S DEPART WENT OF HEALTH. 

EDUCATION & WELFARE 
OFFICE OF EDUCATION 
THIS DOCUMENT HAS BEEN REPRO- 
DUCED EXACTLY AS RECEIVED FROM 
THE PERSON OR ORGANIZATION GRIG^ 
1NATING IT POINTS OF VIEW OR OPIN 
IONS STATED DO NOT NECESSARILY 
REPRESENT OFFICIAL OFFICE OF EDU - 
CATION POSITION OR POLICY. 



FI?JAL REPORT 



DEMAND .MODELS:- FOR BOOKS 
III LIBRARY GXRGULATIOFT ,f?YSTTJMP 







, ' Gerald J Q : Lasorich , ■ . 

.. -T i: As aociate": Professor 

• School: ; of ylnforsiatxon;'., 

Facn lty of Engineering 













'TABLE OP CONTENTS 



Page No . 



ABSTRACT 




-i 


ACKNOWLEDGEMENTS 


ii i 


LIST OP TABLES 




X 


LIST OP FIGURES 




XX 


1 . 


INTRODUCTION 


1 


1 . 1 


Background 


1 


1 . 2 


Definition of a Demand for a Book 


4 


1 .3 


Distinction between Book Demand, 
Book Use and Book Circulation 


5 


1 . 4 


Definitions of Terms 


6 


1 . 5 


Objectives of the Research 


8 


1.6 


Approach to the Problem 


8 


1 .7 


Conventions Employed in the 
Report 


in 


2 . 


REVIEW OF RELATED RESEARCH 


1 2 


2.1 


The First Book Use Study 


1 3 


2.2 


The Fussier and Simon Data at the 
University of Chicago Library 


13 


2.3 


Deterministic Book Use Models 


1 6 


2.4 


Probabilistic Book Use Models 


'18 


2.5 


The Jam Study at Purdue 
Un iversity 


19 


2.6 


The Morse Book Demand Model 


26 


3. 


THE NEGATIVE BINOMIAL DISTRIBUTION 


32 


3.1 


Derivation of the Negative 
Binomial Distribution 


33 




2 



iv 



3.2 

3.3 

3.4 

3.4.1 

3.4.2 

3.4.3 

4. 

4 . 1 

4.2 

4-3 

5. 



5.1 

5.2 

5.3 

5.4 

5.5 

5.5, 1 



Page No. 



Parameters of the Negative 

Binomial Distribution 41 

Mean and Variance of the Negative 
Binomial Distribution 42 

Estimation of the Parameters of 

the Negative Binomial Distribution 44 



The Method of Moments 45 

The Method of Maximum Likelihood 45 

The Proportion of Zeros Method 48 

STATISTICAL TECHNIQUES 3ED IN 

THE ANALYSIS OF DATA 49 

Estimating Parameters of the 
Negative Binomial Distribution 49 

Using the Chi Squared Test 
of Goodness of Fit 52 

The Variance Test 60 

THE COLLECTION OF BOOK DEMAND 

AND USE DATA 65 

Types of Demand 65 

Methods Used for Collecting 

Demand Data 67 

Problems and Difficulties in 
Collecting Demand Data 68 

Identification of Books to be 
Included in Random Samples 71 

Samples of Book Demand and Uses 75 

The Collection of Home— Use Book 
Demand Data for The SUNY Buffalo 
Collection 75 




v 



3 



Page No 



5.6 

5.6.1 

5.6.2 

6 . 

6 . 1 
6 . 1.1 

6 . 1.2 

6 . 2 
6 , 2.1 

6 . 2.2 

6 . 3 

6.4 

7. 

7, 1 

7.2 

7.3 

7.4 

7.5 



Collection of Book Demand Data 

for Language and Literature 

(P Class) Books 78 

Records of P Class Demand Data 86 

Identifying a P Class Random 

Sample for Analysis 89 

ANALYSIS OF FREQUENCIES OF KNOWN 

AND UNKNOWN DEMAND FOR BOOKS 91 

The Known Demand Sample 91 

The Chi Squared Goodness of Fit 

Test 93 

The Variance Test 93 

Sample of P Class Book Demands 95 

The Chi Squared Goodness of Fit 

Test 97 

The Variance Test 100 

Analysis of the Distributions 
of Known and Unknown Demand 100 

Implications of Negative 

Binomiaily Distributed Demands 102 

DAILY DEMAND FOR P CLASS BOOKS 104 

Randomly Selecting Books to be 
Included in the Samples 1 05 

The Chi Squared Goodness of Fit 

Test 107 

The Variance Test 107 

Analysis of Daily Demands 109 

Significance of the Distribution 
of Daily Demands 113 




vi 



Page No , 



8. BOOK USE IN SEVERAL INSTITUTIONS 116 

8.1 The Fussier and Simon Data 116 

8.2 Fitting the Negative Binomial 

Distribution 119 

8.3 The Variance Test 121 

8.4 Analysis 125 

9 . THE PREDICTION OF DEMAND 

FOR BOOKS 130 

9.1 A Two Stage Book Use Model 130 

9 . 2 Comments Concerning the Two 

Stage Book Use Model 137 



9*3 Use of the Two Stage Book Use 

Model for Loan Interval 
Establishment for a Book with 
a Known Demand Rate 138 

9.4 Use of the Two Stage Book Use 

Model for Loan Interval 
Establishment for a Book with 



a Demand 1 Rate Which is Unknown 140 

9.5 The Concept of Critical A 144 

9.6 Using the Critical^ Concept 145 

9.7 Establishing Confidence Intervals 

for Demand Rates for Books in a 
Collection 148 

9.8 Comments and Constraints 

Concerning Use of the Demand 

Models 153 

10. SUMMARY OF RESULTS, OBSERVATIONS 

AND RECOMMENDATIONS FOR FURTHER 

WORK 156 



vii 



O 

ERIC 



5 



Page No 



nr 

-i 



M 



10.1 Summary of Results and 

Observations 1 56 

10.2 Recommendations for Further 

Work 159 

REFERENCES 160 

RELATED REFERENCES 164 

APPENDICES 

A QUESTIONNAIRE FOR P CLASS BOOK 

DEMANDS 168 

B INSTRUCTIONS TO DATA COLLECTORS 169 

C PUNCHED CARD FORMAT 174 

D TABLE USED TO CALCULATE CHI 



SQUARED FOR FITTING THE NEGATIVE 
BINOMIAL DISTRIBUTION TO THE 
SHELF LIST SAMPLE FROM THE SUNY- 
BUFFALO COLLECTION 175 

E TABLE USED TO CALCULATE CHI 

SQUARED FOR THE VARIANCE TEST ON 
THE SHELF LIST SAMPLE FROM THE 
SUNY-BUFFALO COLLECTION 177 

F TABLE USED TO CALCULATE CHI 

SQUARED FOR FITTING THE NEGATIVE 

BINOMIAL DISTRIBUTION TO THE 

SAMPLE OF P CLASS BOOKS 179 

G TABLE USED TO CALCULATE CHI 

SQUARED FOR THE VARIANCE TEST ON 

THE RANDOM SAMPLE OF P CLASS 

BOOKS 1 8 1 

H TABLES USED TO CALCULATE CHI 

SQUARED FOR FITTING THE POISSON 

DISTRIBUTION TO THE RANDOM 

SAMPLES OF P CLASS DEMANDED BOOKS 183 



I 




viii 

G 



Page No 



I TABLES USED TO CALCULATE 

CIII SQUARED FOR THE VARIANCE 
TEST ON THE RANDOM SAMPLES OF 
P CLASS DEMANDED BOOKS 

J TABLES USED TO CALCULATE CHI 

SQUARED FOR FITTING THE NEGATIVE 
BINOMIAL DISTRIBUTION TO THE 
FUSSLER AND SIMON BOOK USE DATA 

K TABLES USED TO CALCULATE CHI 

SQUARED FOR THE VARIANCE TEST 
FOR THE FUSSLER AND SIMON BOOK 
USE DATA 




ix 

7 



189 



195 



205 



ABSTRACT 



This research is a study of demands for books in 
library circulation systems. Demand data for random 
samples of books were collected and fit to various 
standard distributions. The numbers of demands for 
collections of books are shown to be Negative Binomiaily 
distributed. As is shown, this implies that the numbers 
of demands for individual books in the collection are 
Poisson distributed and that the demand rate varies from 
book to book according to a Gamma distribution. Using 
these facts and assuming Exponentially distributed loan 
intervals, a model is developed which will predict the 
availability ar.id unavailability of a book in a library. 
The practicality of using the model is demonstrated. 



ii 



8 






ACKNOWLEDGEMENT & 

The author expresses his thanks to ^r. Bernard I. 
Bayer for his helpful editing comments , and the staff 
of the Technical Information Dissemination Bureau for 
helping to prepare this document* 



iii 




3 



ACKNGULE DGEMENTS 

The author expresses his thanks to Professors 
VJarren Thomas (Major Professor) , Richard Schmidt, 
Uayland P„ Smith and Vincent E. Giuliano for their 
suggestions, Mr, Bernard I. Bayer for his helpful 
editing comments, and the staff of the Technical 
Information Dissemination Bureau for helping to 
prepare this document. Thanks is also expressed to 
the author's wife, Sylvia, for the many ways in which 
she helped during the course of the research. 

Partial support for this research was 
received from the National Science Foundation 
(Grant Number GN-706). 



iii 




10 



LIST OF TABLES 



Table Page No . 



4.1 TABULATION OF VALUES USED TO 

ILLUSTRATE THE CALCULATION OF r 53 

4.2 CHI SQUARED TABLE USED FOR 

ILLUSTRATING CHI SQUARED TEST 
CALCULATIONS 55 

4 . 3 TABLE USED TO ILLUSTRATE POWER 

OF CHI SQUARED GOODNESS OF FIT 

TESTS 58 

4.4 VARIANCE TEST TABLE 63 

5.1 A PORTION OF THE RANDOMLY GENERATED 

SHELF LIST DEPTHS FOR RANDOM TITLE 
IDENTIFICATION 73 



5 . 2 NUMBER OF BOOKS RANDOMLY SELECTED 

FOR STUDY IN EACH OF THE SUNY- 
BUFFALO LIBRARIES 77 



5.3 



5.4 

5.5 

5.6 



HOME USE DEMANDS OF TEN RANDOM 
SAMPLES OF BOOKS DURING THE PERIOD 



9/1/66 - 8/31/67 79 

TABULATION OF DATA FROM MARCH 3 , 

1968 THRU MAY 11, 1968 84 

QUESTIONNAIRE SUMMARY OF P CLASS 

DATA 8 7 

SUMMARY OF RANDOM SAMPLE OF P 

CLASS BOOK DEMANDS 90 



6 . 1 FREQUENCY DISTRIBUTION OF KNOWN 

DEMANDS FOR A SAMPLE OF BOOKS FROM 
THE SUNY-BUFFALO COLLECTION DURING 
THE PERIOD SEPTEMBER 1, 1966 - 

AUGUST 31 , 19-67 92 




X 



li 



Table 



Page No, 



6 . 2 



6.3 



6 . 4 



6 . 5 



6.6 



7.1 



7.2 



7.3 



CHI SQUARED GOODNESS OF FIT 
TABLE FOR A NEGATIVE BINOMIAL 
FIT TO THE FREQUENCIES OF KNOWN 
DEMAND FOR A SAMPLE OF BOOKS 
FROM THE SUNY-* BUFFALO LIBRARY 
COLLECTION DURING THE PERIOD 
SEPTEMBER 1, 1966 - AUGUST 31, 

1967 94 

VARIANCE TEST TABLE FOR KNOWN 
DEMAND SAMPLE OF BOOKS DEMANDED 
FROM SEPTEMBER 1, 1966 - 

AUGUST 31, 1967 9 6 



FREQUENCY DISTRIBUTION OF DEMANDS 

FOR P CLASS BOOKS DURING THE 

PERIOD MARCH 3, 1968 - MAY 11, 1968 98 

CHI SQUARED GOODNESS OF FIT TABLE 
FOR A NEGATIVE BINOMIAL FIT TO 
THE FREQUENCIES OF DEMAND FOR THE 
SAMPLE OF P CLASS BOOKS DEMANDED 
DURING MARCH 3, 1968 - MAY 11, 

1968 99 

VARIANCE TEST TABLE FOR DEMANDS 

FOR P CLASS SAMPLE OF BOOKS 

DEMANDS D FROM MARCH 3, 19 68 - 

MAY 11,1968 101 

DAILY DEMAND FREQUENCIES OF TEN 

RANDOM SAMPLES OF ONE HUNDRED P 

CLASS BOOKS DEMANDED DURING 

MARCH 3, 1968 - MAY 11, 1968 106 



SUMMARY OF THE CHI SQUARED 

GOODNESS OF FIT TEST FOR A POISSON 

FIT TO THE FREQUENCIES OF DAILY 

DEMkUD FOR SAMPLES OF BOOKS 

DEMANDED DURING THE PERIOD 

MARCH 3, 1968 - MAY 11, 1968 108 



SUMMARY OF THE RESULTS OF THE 
VARIANCE TEST RUN ON SAMPLES OF 
DAILY DEMAND FREQUENCIES OF P 
CLASS BOOKS DEMANDED DURING THE 



PERIOD MARCH 3, 
MAY 11, 1968 



1968 




12 



;> 

;X 



1 10 



Table 



Page No 



8.1 COMPARISONS OF THE USE OF THE 

SAME PHILOSOPHY TITLES AT 
CHICAGO AND YALE 

8 . 2 NUMBER OF USES OF BOOKS IN 

SAMPLES IN SPECIFIED SUBJECT 
AREAS IN THE CHICAGO, YALE, 
NORTHWESTERN AND BERKELEY 
LIBRARIES DURING THE PERIOD 
1954 - 1958 

8 . 3 VALUES OF N , m , r , p , df , , 

AND P FOR THE NEGATIVE BINOMIAL 
DISTRIBUTIONS WHICH WERE FIT TO 
THE DATA FOR SPECIFIED SUBJECT 
AREAS AND INSTITUTIONS 



8. 4 



D. 1 



E. 1 



F. 1 



G . 1 



RESULTS OF VARIANCE TEST ON 
SAMPLES OF BOOK USES IN SPECIFIED 
INSTITUTIONS 



CHI SQUARED TEST TABLE FOR KNOWN 
DEMAND SAMPLE COLLECTED FOR THE 
SUNY-BUFFALO COLLECTION DURING 
THE PERIOD SEPTEMBER 1, 19 66 - 

AUGUST 31, 1967 

VARIANCE TEST TABLE FOR KNOWN 
DEMAND SAMPLE COLLECTED FOR THE 
SUNY-BUFFALO COLLECTION LURING 
THE PERIOD SEPTEMBER 1, 1966 - 

AUGUST 31, 1967 

CHI SQUARED TEST TABLE FOR P 
CLASS SAMPLE COLLECTED FOR THE 
SUNY-BUFFALO -P CLASS COLLECTION 
DURING THE PERIOD MARCH 3, 1968 - 

MAY 11, 1968 

VARIANCE TEST TABLE FOR P CLASS 
DEMAND SAMPLE COLLECTED FOR THE 
SUNY-BUFFALO P CLASS COLLECTION 
DURING THE PERIOD MARCH 3, 1968 - 

MAY 11, 1968 




x:j.i , 

13 



1 13 

1 20 

122 

124 

1 76 

178 

180 



182 



Table 

H.1 CHI SQUARED GOODNESS OF FIT 

TABLE FOR FITTING THE POISSON 
DISTRIBUTION TO SAMPLE 1 OF P 
| CLASS DEMANDS FOR THE PERIOD 

MARCH 3, 1968 - MAY 11, 1968 

fl. 2 CHI SQUARED GOODNESS OF PIT 

TABLE FOR FITTING THE POISSON 
DISTRIBUTION TO SAMPLE 2 OF P 
CLASS DEMANDS FOR THE PERIOD 
MARCH 3 , 1 9 6 8 - MAY 11, 19 6 8 

H . 3 CHI SQUARED GOODNESS OF FIT 

TABLE FOR FITTING THE POISSON 
DISTRIBUTION TO SAMPLE 3 OF P 
CLASS DEMANDS FOR THE PERIOD 
MARCH 3, 1968 - MAY 11, 1968 

H , 4 CHI SQUARED GOODNESS OF FIT 

TABLE FOR FITTING THE POISSON 
DISTRIBUTION TO SAMPLE 4 OF P 
CLASS DEMANDS FOR THE PERIOD 
MARCH 3, 1968 - MAY 11, 1968 

H . 5 CHI SQUARED GOODNESS OF FIT 

TABLE FOR FITTING THE POISSON 
DISTRIBUTION TO SAMPLE 5 OF P 
CLASS DEMANDS FOR THE PERIOD 
MARCH 3, 1968 - MAY 11, 1968 

H , 6 CHI SQUARED GOODNESS OF FIT 

TABLE FOR FITTING THE POISSON 
DISTRIBUTION TO SAMPLE 6 OF P 
CLASS DEMANDS FOR THE PERIOD 
MARCH 3 , 1 9 6 8 - MAY 11, 19 68 

H * 7 CHI SQUARED GOODNESS OF FIT 

TABLE FOR FITTING THE POISSON 
DISTRIBUTION TO SAMPLE 7 OF P 
CLASS DEMANDS FOR THE PERIOD 
MARCH 3 , 1 9 6 8 - MAY 11, 19 6 8 

H.8 CHI SQUARED GOODNESS OF FIT 

TABLE FOR FITTING THE POISSON 
DISTRIBUTION TO SAMPLE 8 OF P 
CLASS DEMANDS FOR THE PERIOD 
MARCH 3 , 1968 - MAY 11, 1968 




14 



Page No. 



184 



1 84 



1 85 



1 85 



I 86 



186 



187 



187 



xiii 



Table 



Page No 



H, 9 



H, 1 0 



X. 1 



1.2 



X , 3 



X.4 



1.5 



X . 6 



X. 7 



CHI SQUARED GOODNESS OP FIT 
TABLE FOR FITTING THE POISSON 
DISTRIBUTION TO SAMPLE 9 OF P 
CLASS DEMANDS FOR THE PERIOD 
MARCH 3, 1968 - MAY 11, 1968 

CHI SQUARED GOODNESS OF FIT 
TABLE FOR FITTING THE POISSON 
DISTRIBUTION TO SAMPLE 1 0 OF P 
CLASS DEMANDS FOR THE PERIOD 
MARCH 3, 1968 - MAY 11, 1968 

VARIANCE TEST TABLE FOR SAMPLE 

1 OF THE P CLASS DEMANDS 

DURING THE PERIOD MARCH 3, 1968 

MAY 11, 1968 

VARIANCE TEST TABLE FOR SAMPLE 

2 OF THE P CLASS DEMANDS 

DURING THE PERIOD MARCH 3, 1968 

MAY 11, 1968 

VARIANCE TEST TABLE FOR SAMPLE 

3 OF THE P CLASS DEMANDS 

DURING THE PERIOD MARCH 3, 1968 

MAY 11, 1968 

VARIANCE TEST TABLE FOR SAMPLE 

4 OF THE P CLASS DEMANDS 

DURING THE PERIOD MARCH 3, 1968 

MAY 11, 1968 

VARIANCE TEST TABLE FOR SAMPLE 

5 OF THE P CLASS DEMANDS 

DURING THE PERIOD MARCH 3, 1968 

MAY 11, 1968 

VARIANCE TEST TABLE FOR SAMPLE 

6 OF THE P CLASS DEMANDS 

DURING THE PERIOD MARCH 3, 1968 

MAY 11, 1968 

VARIANCE TEST TABLE FOR SAMPLE 

7 OF THE P CLASS DEMANDS 

DURING TIIE PERIOD MARCH 3, 196 8 

MAY 11, 1968 



188 



1 88 



190 



190 



191 



191 



192 



192 



193 




xiv !5 



Table 



Page No 



X . 8 VARIANCE TEST TABLE FOR SAMPLE 

8 OF THE P CLASS DEMANDS 

DURING THE PERIOD MARCH 3 , 1968 - 

MAY 11, 1968 

1.9 VARIANCE TEST TABLE FOR SAMPLE 

9 OF THE P CLASS DEMANDS 

DURING THE PERIOD MARCH 3, 19 68 - 

MAY 11,1968 



1.10 VARIANCE TEST TABLE FOR SAMPLE 

10 OF THE P CLASS DEMANDS 
DURING THE PERIOD MARCH 3, 1968 - 

MAY 11, 1968 

.J. 1 CHI SQUARED TABLE FOR FITTING THE 

NEGATIVE BINOMIAL DISTRIBUTION TO 
SAMPLES OF BOOK USE FOR THE 
PHILOSOPHY SAMPLE AT CHICAGO WHEN 
COMPARED WITH YALE 



J. 2 CHI SQUARED TABLE FOR FITTING THE 

NEGATIVE BINOMIAL DISTRIBUTION TO 
SAMPLES OF BOOK USE FOR THE 
PHILOSOPHY SAMPLE AT YALE 

J . 3 CHI SQUARED TABLE FOR FITTING THE 

NEGATIVE BINOMIAL DISTRIBUTION TO 
SAMPLES OF BOOK USE FOR THE 
TEUTONIC LANGUAGES SAMPLE AT 
CHICAGO WHEN COMPARED WITH YALE 



J . 4 CHI SQUARED TABLE FOR FITTING THE 

NEGATIVE BINOMIAL DISTRIBUTION TO 
SAMPLES OF BOOK USE FOR THE 
TEUTONIC LANGUAGES SAMPLE AT YALE 



J . 5 CHI SQUARED TABLE FOR FITTING THE 

NEGATIVE BINOMIAL DISTRIBUTION TO 
SAMPLES OF BOOK USE FOR THE 
TEUTONIC LANGUAGES SAMPLE AT 
CHICAGO WHEN COMPARED WITH 
NORTHWESTERN 




XV 



193 



194 



194 



196 



196 



197 



197 



198 



16 



Table 



Page Ho 



J.6 CHI SQUARED TABLE FOR FITTING THE 

NEGATIVE BINOMIAL DISTRIBUTION TO 
SAMPLES OF BOOK USE FOR THE 
TEUTONIC LANGUAGES SAMPLE AT 
NORTHWESTERN 198 



J.7 CHI SQUARED TABLE FOR FITTING THE 

NEGATIVE BINOMIAL DISTRIBUTION TO 

SAMPLES OF BOOK USE FOR THE 

TEUTONIC LANGUAGES SAMPLE AT 

CHICAGO WHEN COMPARED WITH 

CALIFORNIA 199 

J , 8 CHI SQUARED TABLE FOR FITTING THE 

NEGATIVE BINOMIAL DISTRIBUTION TO 
SAMPLES OF BOOK USE FOR THE 
TEUTONIC LANGUAGES SAMPLE AT 
CALIFORNIA 199 



J.9 CHI SQUARED TABLE FOR FITTING THE 

NEGATIVE BINOMIAL DISTRIBUTION TO 
SAMPLES OF BOOK USE FOR THE 
BIOLOGY SAMPLE AT CHICAGO WHEN 
COMPARED WITH YALE 200 



J,10 CHI SQUARED TABLE FOR FITTING THE 

NEGATIVE BINOMIAL DISTRIBUTION TO 

SAMPLES OF BOOK USE FOR THE 

BIOLOGY SAMPLE AT YALE 200 



J„11 CHI SQUARED TABLE FOR FITTING THE 

NEGATIVE BINOMIAL DISTRIBUTION TO 
SAMPLES OF BOOK USE FOR THE 
BIOLOGY SAMPLE AT CHICAGO WHEN 
COMPARED WITH CALIFORNIA 201 

J,12 CHI SQUARED TABLE FOR FITTING THE 

NEGATIVE BINOMIAL DISTRIBUTION TO 

SAMPLES OF BOOK USE FOR THE 

BIOLOGY SAMPLE AT CALIFORNIA 201 



J » 1 3 CHI SQUARED TABLE FOR FITTING THE 

NEGATIVE BINOMIAL DISTRIBUTION TO 
SAMPLES OF BOOK USE FOR THE 
PHYSICS SAMPLE AT CHICAGO WHEN 
COMPARED WITH NORTHWESTERN 202 




xvi 



17 






Table 



Page No 



J. 1 4 



J. 15 



J. 1 6 



J , 1 7 



J, 18 

K, 1 

K.2 
K, 3 

K. 4 



CHI SQUARED TABLE FOR FITTING THE 
NEGATIVE BINOMIAL DISTRIBUTION TO 
SAMPLES OF BOOR USE FOR THE 
PHYSICS SAMPLE AT NORTHWESTERN 

CHI SQUARED TABLE FOR FITTING THE 
NEGATIVE BINOMIAL DISTRIBUTION TO 
SAMPLES OF BOOK USE FOR THE 
ECONOMICS SAMPLE AT CHICAGO WHEN 
COMPARED WITH NORTHWESTERN 

CHI SQUARED TABLE FOR FITTING THE 
NEGATIVE BINOMIAL DISTRIBUTION TO 
SAMPLES OF BOOK USE FOR THE 
ECONOMICS SAMPLE AT NORTHWESTERN 

CHI SQUARED TABLE FOR FITTING THE 
NEGATIVE BINOMIAL DISTRIBUTION TO 
SAMPLES OF BOOK USE FOR THE 
ECONOMICS SAMPLE AT CHICAGO WHEN 
COMPARED WITH CALIFORNIA 

CHI SQUARED TABLE FOR FITTING THE 
NEGATIVE BINOMIAL DISTRIBUTION TO 
SAMPLES OF BOOK USE FOR THE 
ECONOMICS SAMPLE AT CALIFORNIA 

VARIANCE TEST TABLE FOR BOOK USE 
OF THE PHILOSOPHY SAMPLE AT 
CHICAGO WHEN COMPARED WITH YALE 

VARIANCE TEST TABLE FOR BOOK USE 
OF THE PHILOSOPHY SAMPLE AT YALE 

VARIANCE TEST TABLE FOR BOOK USE 
OF THE TEUTONIC LANGUAGES SAMPLE 
AT CHICAGO WHEN COMPARED WITH 
YALE 

VARIANCE TEST IABLB FOR BOOK USE 
OF THE TEUTONIC LANGUAGES SAMPLE 
AT YALE 



202 

203 

203 

204 

204 

206 

206 

207 



207 



Table 



Page Mo 



K. 5 

K, 6 
K. 7 

K. 3 

K. 9 

K. 1 0 
K, 1 1 

K, 1 2 

K , 1 3 

K» 1 4 

K. 1 5 



VARIANCE TEST TABLE FOR BOOK USE 
OF THE TEUTONIC LANGUAGES SAMPLE 
AT CHICAGO WHEN COMPARED WITH 
NORTHWESTERN 208 

VARIANCE TEST TABLE FOR BOOK USE 
OF THE TEUTONIC LANGUAGES SAMPLE 
AT NO RTHWE STERN 2 0 8 

VARIANCE TEST TABLE FOR BOOK USE 
OF THE TEUTONIC LANGUAGES SAMPLE 
AT CHICAGO WHEN COMPARED WITH 
CALIFORNIA 209 

VARIANCE TEST TABLE FOR BOOK USE 
OF THE TEUTONIC LANGUAGES SAMPLE 
AT CALIFORNIA 209 

VARIANCE TEST TABLE FOR BOOK USE 
OF THE BIOLOGY SAMPLE AT CHICAGO 
WHEN COMPARED WITH YALE 210 

VARIANCE TEST TABLE FOR BOOK USE 
OF THE BIOLOGY SAMPLE AT YALE 210 

VARIANCE TEST TABLE FOR BOOK USE 
OF THE BIOLOGY SAMPLE AT CHICAGO 
WHEN COMPARED WITH CALIFORNIA 211 

VARIANCE TEST TABLE FOR BOOK USE 
OF THE BIOLOGY SAMPLE AT 

CALIFORNIA 211 

VARIANCE TEST TABLE FOR BOOK USE 
OF THE PHYSICS SAMPLE AT CHICAGO 
WHEN COMPARED WITH NORTHWESTERN 212 

VARIANCE TEST TABLE FOR BOOK USE 
OF THE PHYSICS SAMPLE AT 

NORTHWESTERN 212 

VARIANCE TEST TABLE FOR BOOK USE 
OF THE ECONOMICS SAMPLE AT 
CHICAGO WHEN COMPARED WITH 
NORTHWESTERN 213 



O 

ERIC 



xviii 



Table 



VARIANCE TEST TABLE FOR BOOK USE 
OF THE ECONOMICS SAMPLE AT 
NORTHWESTERN 



Page No 



K. 1 6 
K. 1 7 



K. 1 8 



2 1 3 



VARIANCE TEST TABLE FOR BOOK USE 
OF THE ECONOMICS SAMPLE AT 
CHICAGO WHEN COMPARED WITH 

CALIFORNIA 214 

VARIANCE TEST TABLE FOR BOOK USE 

OF THE ECONOMICS SAMPLE AT 

CALIFORNIA 214 




xix 



20 



Figure 



Page No 



LIST OF FIGURES 



5,1 


Q UE S T I OWN A I RE 




81 


5.2 


SAMPLE PAGE FROM 


MASTER 






FILE LISTING 




88 


9,1 


A TWO STAGE BOOK 


USE PROCESS 


132 




xx 

2 ± 



CHAPTER 1 
INTRODUCTION 



1.1 Background 

Ever since libraries have been in existence, methods 
were used by librarians to control the availability of 
library materials so as to satisfy the demands of patrons, 
Tradit ionally , librarians have controlled the availability 
of books by setting the loan intervals or by ordering 
multiple copies of books in high demand. Circulation 
department librarians when exercising control in tills 
manner have given attention,, although probably not overtly, 
to the two most important objectives of a circulation 
department in a library. These are: (1) to have a book 

available when it is demanded and (2) to allow a patron to 
charge out a book for as long as he needs it. In establishing 
loan intervals , the librarian must and does give attention to 
these objectives. Little has been done to analytically 
determine these loan intervals. In ail cases in all 
libraries they have been subjectively, although thought- 
fully established. Analytic methods have not been used 
probably because the kinds of data which are needed for 
a thorough analytic evaluation arc not easily available 
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and because collection of data of the type needed is time 
consuming and expensive. 



The advocates of the use of comnuters in libraries 



are quick to point out that automated circulation systems 
’..’ill not only reduce patron time, b\it will also provide 
data concerning book usage, which will be useful in the 
planning functions of the library, Truoswel 1 (1964) points 



"Mechanisation of library procedure cannot 
directly improve the probability of not 
finding the book wanted, but instead merely 
makes the statistic more obvious. However, 
one of the benefits of a mechanised punched 
card control system is the fact that complete 
analyses can be made about the use of books 
and their frequency of circulation. From 
these analyses it would be possible to develop 
new operating procedures relative to charge 
periods, duplicate copies and non— circulation 
copies. These new procedures would reduce the 
probability of the user being unable to find t? o 
book he wanted, " 



In undertaking this research, this investigator 



contended that librarians and the computer specialists did 

not really know how they would use the data which were 
0 

soon to inundate them. What does it mean when book M 
circulates 1, or 5, or 25 times a year? How can the 
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librarian use this knowledge? Horse (1963, PP« 141—142) 
apparently shares this investigator's concern when he 
states : 



"It is the author's belief based on discouraging 
experience , that neither the cornu tor exports 
nor the librarian (for different reasons) 
really knovr ’.’hat data would 
be useful for the librarian to have 
collected, analysed, and displayed, so ho can 
make decisions with some knowledge of what the 
decision implies. Hhat is needed before the 
computer designs are frozen is for models, 
of the sort developed in this boob, to bo 
played with, to see which of them could be 
useful and to see what data are needed and in 
what form, in order that both models and 
computers can be used most effectively by 
the librarian. 

If it is so designed, the computer not only can 
gather the needed data but also can efficiently 
help to carry out the operating policy, once 
decided on .... Again what is needed is 
some experimenting with models of operation 
of the sort discussed in this book, so as to 
insure that the computer will utilize, as 
completely as possible, the data it gathers 
and stores and also to prevent the system 
from be ing overdesigned to do things that 
turn out to be inutile or vacuous." 



One of the most important variables that must be 
predicted before analytically derived decision rules 
concerning library circulation problems could ho formulated 



is the demand for a hook 



A knowledge of how many timer; 



a book is going to bo demanded is most important in the 
establishing of decision rules for library circulation 
systems. Predicting book circulation or book use is not 
sufficient for determining circulation department, policy. 

One must be able to predict or estimate demand. It is not 
unlike the sales problem in which future sales are predicted 
on the basis of past sales and not on demand# possibly 
resulting in a significant number of dissatisfied or lost 
cus tomers , 

1.2 Definition of a Demand for a book 



For the purposes of this research, a demand for a 
specific book is an expressed or unexpressed need for a 
book once it has been identified as being available in the 
library , 



An expressed need is one in which the library system 
knows that the need exists either because the patron found 
the book he wanted and charged it out or did not find the 
book he wanted and asked that it be saved or located. 
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An unexpressed need is one in which the library 



system does not know that the need exists either because 
the patron found the book he wanted, used the book in the 
library but did not charge it out, or one in which the 
patron did not find the book in the library but did not 
indicate that the book should be saved or found. 

A book found by browsing becomes a demand once a 
book is found which the patron will use. An unsatisfied 
need of a browser then doss not satisfy the definition of 
demand made above. 

There are also demands for books which are not in 
the collection of the library, such as those books for which 
no cards are found in the card catalog. This type 
demand does not meet the definition of demand given above. 

1,3 Distinction between Book Demand , Book Use and Book 

Circulation 

When a patron has identified that a book he needs 
is in the library and attempts to get the book , a demand 
lias occurred. If lie finds the book and uses it in the 
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library or if he charges out the book for hone use, we say 
that a book use has occurred. The latter type use is called 
a circulation. The literature is confusing as to what 
actually has been studied in particular investigations. 

This research reported here is a study of book demand vrhich 
includes book use and circulations as well as unsatisfied 
demand. 

1 . 4 Definitions of Terms 

1 , A loan interval As the interval of time 

for which a library patron may borrow a book from a library. 

2 , A clos ed stack library is one in which access to 
the collection is not available to the patron (no browsing) . 

3, An open stack library is one in which access to 
the collection is available to the patron (browsing allowed) . 

4, The demand for a library book is an expressed 
(known to the library system) or unexpressed 
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(unknown to the library system) need for a book which has 
been identified as being in the library's collection. 

5. A satisfied demand for a book occurs when the 
patron finds the book which he seeks. 



6. An unsa t isf led demand occurs when a patron does 
not find the book which he seeks. 

7. A book use occurs when a demand is satisfied and 
the patron uses the book in the library or charges it out. 



8 , A book circulation occurs when a demand is 
satisfied and the patron charges out the book. 



D. A known demand is one which becomes known to the 
library system either because the patron charges out the book 
does not find the book but asks the circulation department 
staff to find it, or finds that the book is already charged 
out and asks that the book foe saved when it is returned. 



in 



An unknown demand occurs when the patron does 



not find the book he seeks, does not ask the circulation staff 
to find it, and does not ask that it be saved, or else the 
patron finds the book he seeks and uses it in the library. 



1 . 5 ob j ectives of the Research 



This investigation is a study of book demand and was 
conducted in order to: 

1. Provide evidence that one cannot reject the 
hypothesis that both the known and unknown demand for books 
is Poisson distributed. 

2 . Develop a practical model to predict total and 
unknown demand based on known demand data readily available 
in library circulation systems. 

1.6 Approach to the Problem 

In order to fulfill Objective 1, book demand data 
were collected to verify the hypothesis that the number of 
demands for individual books is Poisson distributed 
cannot be rejected. This was done by randomly selecting 




books from the entire SUNY/Buf f alo Library collection and 
collecting data concerning the number of times each book 
was charged out of the library over a period of time. 

In addition, for all books i,t the P Class (Language 
and Literature) collection all known and unknown demands 
were recorded from February 13, 1968 through May 11, 1968, 

Random samples of P Class books were then selected for 
analysis . 

With both sets of demand data, various standard 
distributions were fit to the number of demands for 
samples of books. Ultimately, the Negative Binomial 
distribution was fit to the data. This indicated that 
the demands for individual books are Poisson distributed, 
but that the mean demand rate varies from book to book, 
according to a Gamma distribution. The Poisson hypothesis 
was further verified by the data on the number of demands 
per day of random samples of books, and showed that the 
number of demands per day was Poisson distributed. 



Further evidence is given to permit non-rejection of the 
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Poisson hypothesis by the analysis of 18 samples of data 
presented by Fussier and Simon (1961) concerning book use 
in various subject areas at the libraries at Yale 
University, Northwestern University, the University of 
California at Berkeley, and the University of Chicago. 

In order to fulfill Objective 2, the knowledge that 
demands for individual books is Poisson distributed was 
used to develop a model with which the number of demands 
for a book could be predicted, from information about 
known demand, specifically, the number of times the book 
circulates over a period of time. At this point, it was 
demonstrated that the known demand information could provide 
insights as to the future availability, or unavilability , 
of the book in the library. 



1.7 Conventions Employed in the Report 

The decimal numbering system was used to number 
Chapters, Sections, and Subsections. Appendices are 
distinguished from the body of the report by using capital 
letters. when appropriate, equations are numbered 
consecutively throughout each chapter. The equation 




10 

3l 



numbers are enclosed within brackets. Tables are numbered 
consecutively throughout each chapter and appendix. The 
decimal numbering system is also employed here. 

References are always made to the smallest subsection. 
Thus, a reference to a section implies a reference to 
the appropriate subsection. 





CHAPTER 2 



REVIEW OF RELATED RFFEAP.CTI 

Studies of he use of books , journals and facilities 
in libraries are abundant within the literature. DeWeese 
(1967), for example, published a bibliography of 547 use 
studies found in the literature. In hin thesis, Jain 
(1967) indicates one of his purposes is to catalog the 
important history of statistical analysis and mathematical 
modeling of library usage. He listed 84 references and 
provided detailed appendices so that "in the future, 
research workers would only have to consult his thesis for 
early (up to 1965) references and their summaries," 

Although there are many reported studies of book 
usage models, there are no reported studies of booh demand. 
There is, however , one mathematical prediction model 
developed by Horse in a recently published highly 
significant book which attempts to apply the analytic 
method of operations research-systems analysis to the 
operating problems in the Library (Morse, 1968) « The model 
will be explained later in this section. 

It is the author's purpose here to review the writings 
believed to be relevant to the development of models to 
predict demand even though most of the following studies 
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relate to t>e prediction of book usage and book 
circulations. This review does not cover journal usage 
or journal circulations, 

2 , 1 The First Book Use Study 



The earliest published use study was conducted by 
Ranch (1911) in the Grand Rapids Public Library, His 
study, as do most other use studies, provides breakdowns 
of the used books in a library by various characteristics 
such as class, grade average of students, etc, 

2,2 The Fussier and Simon Data at the University 
of Chicago Libra y 

The first truly large and thorough study of boo): 
usage was made by Fussier and Simon (19P1) at the University 
of Chicago. The fundamental question to which they directed 
their attention was: 
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"Will any kind of statistical procedure 
predict with reasonable accuracy, the 
frequencies with which groups of books 
with defined characteristics are likely 
to be used in a research library?" 

Their study was based on certain assumptions, some 
of which were later examined empirically. These assumption 
were: (1) The recorded circulation use of books is a 

reasonably reliable index of all use, including browsing 
use, within the library. (2) There are some patterns in 
the use of books common to major research libraries. 

(3) Within homogeneous subject areas and types of books 
(i,e, , monographs, serials), use is a suitable initial 
criterion for the segregation of materials into different 
levels of accessibility. (4) Economic factors make it 
highly desirable to segregate books, on the basis of their 
value and use, into two or more levels of accessibility. 
Fussier and Simon used use data on a sample of 600 books 
(400 selected using a systematic sampling plan and 200 
using a stratified random sampling plan) . With these data, 
they studied 22 functional relationships using "use" as 
a dependent variable and various independent variables. 

The 22 functions fell into three broad categories i 
(1) Functions for libraries with no record of prior use. 



( 2 ) Functions that require five-year past use records. 

(35 Functions employing long records of past use. Fussier 
and Fimon conclude that: 



"By far the best predictor of the 
future use of a title is its past use. 
Because of the low probability of 
use in any one year for titles in the 
marginal-value range of titles in a library 
the size of the University of Chicago 
Library, a fifteen or twenty year long 
observation period produces considerably 
better results than will an observation 
period of five years. 

Some research libraries have no 
records of past use. If these libraries 
wish to commence storage immediately our 
results also suggest the best 
possible functions and suggest to them 
the extent of the errors that will arise. 
Our results also suggest the wisdom 
of postponing storage for perhaps five years 
and collecting records of the use 
during that time. If a library does wish to 
initiate a storage plan without waiting 
to collect such records, it might well 
consider setting up a system by which 
high— use books that were sent to storage 
could easily be restored to the working 
collection, " 



Fussier and Simon also point out that their assumption 
of a stochastic model does not suggest that the Poisson 
distribution will approximate the distribution of books 
within a library (or within a given subject area) by the 





frequency of their use during some period of time. They 
say that their observed distributions do not resemble the 
Poisson closely, having a much higher variance. There are 
too many observations at "zero" and at multiple use points. 
Chapter B of the research reported here is devoted to an 
analysis of some of the Fussier and Simon data. 



2 . 3 Deterministic Pook Use M odels 



Most of the reported mathematical models of bool', use 
have been deterministic. The first mathematical model was 
reported by Shaffer and Ernst (1954). Their model was 
developed from a non— random sample of 237 books which were 
borrowed from the M. I.T. Science Library during the period 
May 5 - 10, 1954, After attempting to fit various models, 
they found that the average annual circulation I(t) of a 



t year old book was s 




-o.g-t -o.oist 

10 jL + 1. 1 jl> 



Rothkopf (1962) using a non random sample of 97 books 
which were at least three years old attempted to fit various 
models to the data and considered that a good fit was 
obtained, in which C(t) , the average annual circulation of the 
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book during the first t years in the library, was related 
to G(3), the average circulation of a book during the first 
three years in the library by the model i 



Dawson, Aldrin and Gould (1962) using a sample of 
202 books developed a me lei based on two assumptions. 

(1) Within a homogeneous class of books, there is a 
maximum value of initial circulation rate. The most popular 
books in that class will tend to achieve that rate, r. 

(2) A particular book will eventually decrease in 
popularity and the circulation rate will fall off inversely 
with time. They fit various models and found the best fit 
which would allow the determination of C(t), the cumulative 
circulation of a book during the first t years in the library, 
to be : 



Leimkuhler (1966) proposed a circulation model with 
a constant rate of obsolescence in which the average manual 
circulation rate for books, C(t), whose age is t, can be 
determined using C ft , 



c co 




C CO 



c co - 5.12 ( i + _L_ ) 

o m 



the average annual circulation rate 



in the first year of acquisition (C^h) and E a constant 

rate of obsolescence (0 < E < 1) using the following model: 

= c 0 (i - 83 * 

2,4 Probabilistic Models of Book Use 



Dawson (1962) developed a Markov model for predicting 
the future circulation of a book on the basis of its 
immediate past circulation. His model was developed from 
a sample of 305 hooks. Ills states were 0 circulations per 
year , 1 or 2 circulations per year and 3 or more 

circulations per year. He tried several functions and 
found the following to he close approximations to his 
observed transition probabilities: 
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Morse (1963, 1965, 196R) has suggested a Markov model 



which has been fit to circulation data of certain classes 
of books at the Massachusetts Institute of Technology, For 
all books having a mean circulation m. in a given year, the 
mean circulation n for the subsequent year given by his 
model is: 

51 — a + bn 

where a is the "residual circulation" after time has removed 
the initial popularity and b * rate of diminution of 
circulation with time (0 < b < 1). Morse says that the 
actual circulations in the subsequent year are clustered 
about the expected value in accord with the Poisson 
distribution. Morse suggests that a Markov model is 
appropriate in which the transition probability (P« t *!.) can 
be found by: 

- ( s 



? , 5 The Jain Study at Purdue University 

Since Morse (1963, 1965) suggested that the uses 
of a book followed a Poisson distribution, Jain (1967) 
examined the frequencies of uses of homogeneous groups of 
books from the Chemistry, Physics and Pharmacy Libraries 





at Purdue University and found that the "zero use" class 
does not follow the same probability law as the remaining 
classes. Jain, because of his findings with the "zero use 
class of books went on to develop a model by splitting the 
probability that the book will be used into two components 
(1) the probability of a book not being used and (r») the 
probability of uses if the book is used. He called the 
latter the "Pn Model," lie proposed a truncated Poisson 
distribution as a possible fit to observed frequency 
distributions of book uses. He let = the number of 
uses of a given book during the nth year after its 
acquisition in the library and then; 



where 1 and ^>0 and where, in general , 

X-^f- - log p lie then studied the probability that 




X™ <0, p^, and developed his "pn function, " 





o 



if n * 0 
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Where /S , T are parameters which depend or the clans of bool's 
of which the booh under consideration is a rrer.be r arc! n 
is the year after acquisition. 

Jain (19f. 6) previously had developed a concept which 
he called "relative use" which he used in testing his Pr, 
model and his pn function after he estimated the required 
parameters • Jain developed his method of relative use 
because, he felt, that the two methods previously used to 
study samples of book usage data were inadequate, namely 
the collection method in which random samples (or samples) 
of the entire collection were identified through the shelf 
list and then studied, and the cheek out sample method in 
which books checked out for a period of time were studied . 
'The first method is criticized because it is difficult to 
collect data and because there are problems of missing 
data and of lack of control on the methods of recording 
usage histories in the past, 'The latter method is 
criticized primarily because one cannot draw inferences 
about the entire collection. Because of these deficiencies, 
Jain developed his method of relative use in which three 
independent samples of monograph titles am obtained from 
the total collection (P) 
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the horr? used rrotnr jr.l (I!) and 



the in- library used material (I) • These samples are divided 
into a certain number of groups on the basis of the 
following characteristics of the title: language, country 

of publication, year of publication and year of accession. 
Using as the number of titles in sample 8 which belong 
to the ith group, II ^ as the number of titles in sample V 
which belong to the ith group, and as the number of 
titles in sample I which belong to the ith group, he defined 
relative use R as follows; 

"i + n 

R; - relative use of the ith group = (100) 

As a further illustration of how the relative use 
method cou3d be used to estimate the percentage of 
monographs used in the ith group, Jain presented the 
following using as an example only the checkouts for home 
use ; 



Let ; 



m - number of groups of monographs. 





S- - number of monographs in the 
L ith group in the shelf list sample. 



II- = number of monographs in the 
L ith group in the 19 CP Spring check- 
out sample, i - 1, 2, m. 



h- = number of monographs which are 
included the ith group of 1966 
Spring check out sample and the ith 
group of the shelf list sample, 
i * 1,2, • • • • , m • 



then, relative use for the ith group of monographs is : 



while the collection method estimate of the percentage of 
monographs used in the ith group is; 



If f » fraction of the total collection included in the 
shelf list sample S, then the estimate of the percentage 
of monographs used in the ith group based on relative use 




I! 



( 100 ) 



h 



( 100 ) 
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Jain points out that it is not clear from the above 
that the relative use method of getting the estimate is 
superior to the collection method* but he states that if 
the fraction of the total collection is small or if the 
period for which usage data is recorded is short then it 
seems that the ratio: 




expected value of h^ 



will fluctuate widely around 1 for a given value of F ( - . 

In this case, the relative use estimates are expected to 
do much better than the collection method estimate, 

Jain compared the collection method estimate with 
the relative use estimate and the actual number of books 
used in selected subsets of a sample of books. Relative 
use gave values closer to the true value of use* Jain 
tested his Pn model and pn function by comparing estimates 
from the model with estimates of the relative use. 

After estimating the parameters in his pn function, 
he fit the function to his relative use estimates for five 
subgroups of titles from his shelf list sample. The 
subgroups ranged in size from 22 — 67. lie used the method 



of least squares* am? the fit was poor (P . 2 from .IT to . r * ) - 
Ho also fit the pn function to four suhcrowps frcn the 
chech out snnple and found n* values of . - ? 1 '. He then 
fit the function to three subgroups of the in-library use 
sarple and had poorer results than the hone use sample but 
better results than the shelf list sanple. Using the same 
subgroups he fit his Pn model, a generalised exponential 
model, a generalized, logarithmic model, ard the Horse 
model. Ho concluded that each of the models does equally 
well for predicting Pn, For certain subgroups, ho showed 
that all models were equally poor, 

Jain concludes that the pn function (no use) is quite 
off icier, 1 " in describing the probability of no use of a 
book for various ages and that the Pn Model does as well 
as the three other book usage models when fitted to the 
data from Purdue Libraries. He further concludes that age 
is a significant variable in studying the usage of 
monographs . lie points out that while usage rates of 
individual monographs have considerable variation even 
over a short period of time, the usage rates of various 
age groups do not show any significant difference over 
time. 
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2 . r, 



The Morse Book Demand Model 



Morse (1968, pp, 124-192} develops a model to estimate 
unsatisfied demand for a hook. Earlier calling the number 
of uses per period of time the service rate of the booS , /x, 
he shows that the expected circulation, R t , is? 

A 

JX +■ X 



where X is the demand rate, 
demand isi 



The expected unsatisfied 



u 



7L - l 



R 



X 



Horse using the conditional probability relationships 

PCnl/O PC A} * P(XlH) Pni 

PCX ) - p ( A , 7*1 )p ( X ) 

where P(A,m) = P(xn,A) is the joint probability that a book 
has a demand A and also circulates m times a year, Morse 
uses this relationship and goes from a formula giving 
expected circulation in terms of demand to one giving 
estimated demand in terms of known (or predicted) 
circulation. He develops the model because: 



"Some of the things we would like to know, 
for each homogeneous class of books, are : 
the expected value of the demand for those 
books of a class that circulate m times a 
year; the mean value o = A of the per book 
demand, averaged over all bool's of the class; 
and thus, by subtraction, the mean unsatisfied 
demand U = D — R per book for al the books ir. 
the class, k7e would like to know how the 
unsatisfied demand would change in value 
if some of the books were duplicated of if 
the loan period 1/ax. were changed in value," 



Morse points out that the calculations aren’t easy 



Decause A is not a defi.nite number, the exact number of 
y’ould-be borrowers who come to the library for a given 
■.©ok during the year. It is the expected demand, lie 
joints out that since A is a continuous variable, P(A|?nj 
and P ( A ) are probability densities. Secondly, he points 
>ut, the conditional probability P (m / A ) is a quite 
complicated function and that it is not a bad approximation 
:o use the Poiss - " ^ ~ * 




1 



(: 



M-+ A 



M.% 




Uhere : 



u /t 



is the expected circulation 
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He points out that this approximation is better than some 
made later in his development,, 



Morse points out that there is no way of measuring 
the distribution of demand P (/O for a class of books. lie 
states that it is possible to determine P (m) however, and 
using data collected in three different samples of B76, 
204 and 97 titles in the collection of the Library of 
Science at MIT, found that the distribution is roughly 
geometric, though a semi- log plot showed a slight downward 
curvature, rather than a straight line, ITe points out 
that P(m) is proportional to TT 7 ** but that P(0) is not 
1 -T, The straight line does not extend back to P(0). 

Morse states that since the knowledge of P(mJl.) and 
P(m) alone does not enable one to find P()l) and P(Ajm), 
P(A) has to be guessed and then the guess verified 
indirectly, Morse assumes that P(/t) is exponential since 
it is an extension of the geometric distribution to the 
continuous variable X . He tried : 

poo = (rr) / pooou-i 




as the probability that the demand for a randomly chosen 
book of the class is beti-zeen X- and X + i . Then: 

D = f \ p u ) dx 

and since i 

oO 

p / P (->n!x) P (^)oLX 

o 



substituting: 



p Cm) = 



1 

n ( .d 



00 t , m 

/ Xx \ 

P + ~K J 





Morse states that numerically evaluating the integral for 
various values of m, \ and D produced a distribution 
P (n) which is nearly geometric when plotted against m on 
semi-log paper. Thus, he approximated P (m) with a geometric 
distribution designed to produce the same mean circulation 
as does the formula shown above and data collected by 
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Pourny (1962). Morse found a modified geometric 

a 



distribution to fit: 

P Cm) - 



AncC 



w - 



L 

3 

CX + 1 



Q + L 



■for 'm. > 0 



P Co) = 

a + i 

p (> Tvi) i p( not ) t P ( 9X + 0 • • 

Gt 1 ( Gl 

a+i. 



D * 1 



where Q is set so that 



Gl C Ql + O 

D l 




zpuy+spci) 



or 



1 



1 2 . 



1 

£ 



a -- r (D* 0-* 



Constant Q is the parameter of the modified geometric 
distribution , 

Morse points out that the; value of the mean 
circulation R can be calculated from the circulation 
distribution 



but that the individual P (m) cannot. He does this by using 




the eerie 






ERIC 




/ 

/ 



where E^x) is the exponential integral of x. 



He gives the approximate formulas for the above as 

1.87 



D 



R 



1. 11 



D 



\ 



JA. 



j <JL 



2.5 



R N 2 ' OL * 



KylL 



Thus given R and ja> , D can he calculated and once D is 
known, Q can be calculated. Once 0 is known, P (m) can be 



calculated , 



Morse points out that D - R is the mean value of the 
unsatisfied demand per book of the class, since D is the 
estimated value of the mean demand. 



Morse states that his choice of a simple model and 
the approximations he has made may result in values of D 
being off by 10 or even 30 percent but that, this does not 
matter for the needs because one is generally interested 
in whether D is 2 or 200 and in whether some action 
increases or decreases D *• "R by factors of 2 or more, 
formulas, he states are adequate for such purposes. 
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CHAPTER 3 



THE negative binomial distribution 

The negative binomial distribution has been used in 
many diverse experimental situations. One of the oldest 
applications was made by Greenwood and Yule (1 9?9 ) . They 
found that the number of industrial accidents among workers 
during relatively short periods of time may follow a 
negative binomial distribution. Apparently, the 
distribution over periods of equal lengths of time and for 
the ith worker would be Poisson, but accident proneness 
from worker to worker varies in such a way that the numbers 
of accidents for a group of workers may follow a negative 
binomial distribution. It has also been found that counts 
of insects on experimental plots often conform to a negative 
binomial distribution, presumably because their 
gregariousness or hatching habits introduce " contagion" 
in tlie sense that the location of one insect in a plot 
increases the probability that there are others in the 
plot. 

Kendall (1949) reports that when the birth and death 
rates per individual within a living population are 
constant, and there is a constant re ’ e of immigration. 




SB 



population size may conform to a negative binomial 



The nu. bers of pinholes in 50-foot samples of 
enameled wire have been found by Wise (1946) to con f c .in 
to a negative binomial distribution. Presumably, the rare 
event of a pinhole in such a wire has a changing probability 
from batch to batch. Feller (1957, p. 270) gives an example 
from records of damage caused by lightning as observed 
during n time intervals of length t. In this situation, 
it seems that lightning strikes rarely enough to make a 
Poisson distribution applicable but the damage caused 
varies from strike to strike in such a way that the total 
result is well fitted by a negative binomial distribution. 

3.1 Derivation of the Negative Binomial Distribution 

The negative binomial distribution can be derived 
in several ways. One way produces the negative binomial 
distribution in a waiting time experiment under a 
binomial sampling situation. The number of trials, n, 
are usually fixed and the number of eve’- '-.s f x, are 
observed. In contrast, in a negative J^inomial sampling 



situation , the number of occurrences of the event, r, are 



fixed and the number of failures, x, ( >: - n - r ) , is the 
random variable* The total number of trials then is x + r,. 
The trials always end with the rth occurrence of a success. 
For example, when considering the probability that 3 
occurrences of a tail (failure) will occur before the 
occurrence of a head (success) in a coin tossing experiment 
the probability is; 



If r = 2 and x = 3 , then by the binomial probability 
function , 



and the probability that the 5th toss would result 
in a head is 1/2, therefore: 



p {ttt J = ( j ) 3 



The probability of a success on the fourth toss is: 




Then 




P {3T,in} « ( J ) ( I ) 3 ( I ) 
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In General/ if 



p = probability of a success 
q — probability of a failure 
r - the number of successes 



The probability of r - 1 successes in n - 1 trials 

0:^f r V' r 



and the probability of the rth success on the nth trial 

fn-i ) „ " * «- r „> r 



is : 






r-lJP t 



o 



r(< 



[3.1] 



Another development of the negative binomial 

distribution follows. Assume that death occurs after the 

rth attack of a poison. The probability that an individual 

survives n exposures is the probability that he will be 

attacked 0, 1, , r-1 times during n exposures is: 

r -1 



r— fn \ J 

(j ) i F 

j,0 



Z 



Consider n-1 exposures, death occurs after r attacks 
the probability of surviving n — 1 exposures is; 

r-L 



T 



n-t 

J 



*“i'j j 

1 T 
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The probability of dying during the nth exposure is 
the probability of surviving n - 1 exposures minus the 
probability of surviving n exposures: 




Equation [3,2) above is equal to: 

r-L 
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Factoring out p 
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Since p = 1 - q and 1/q 

r ' 1 n-j-t 

t 



1 “ (1 * q)/q - p/q* [3.2] becomes 




n-L 

i 



'n~i\ n-j j 

i-ijf ? 



3 : 



Putting successively j - 0, 1, ... , r-1 in the 

equation, the positive term of each value of j is cancelled 
by the negative term at the next value of j . Only the 
positive terms at j * r - 1 remain and the probability of 
death during the nth exposure reduces to : 



’^-1 



r . 1 )? t 



r 7pr 



for r{ > r 



0 



for nr^C r 



[3.3] 



This form of the negative binomial distribution is 
called the Pascal or Polya distribution. 

A more specialized form of the negative binomial 
(which generalizes the Poisson) can be developed by 
considering a random variable, x, which follows the Poisson 
distribution but with its parameter regarded as a random 
variable independently following a particular continuous 
probability distribution. This approach was developed by 
Greenwood and Yule (1920) . They said that while for one 




verson the probabj lity of / 1 # ? » •••••» accidents r ay 

follow a Poisson distribution with pararetor A *. 

^ ^ X 

f(*). ± - — *’0,1,2,- 

X' 

[3.4] 

^ nionn nurber , X # of the accidents ray vary fror person 
to person. "he particular probability distribution proposed 
bv Greenwood and Yule was the Ganna (or Pearson Type TXT) , 
whose continuous density function is : 
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-c^. r-1 

Jb X 



[3.5] 

Then the joint density function of x and X is the product 



of [3.4] 



Pine© 



[1.5]. Thus 



h ( %, \ ) * -JT JL 

tr-l') ! 



C r . -ea r-L jlT*7? 



X! 



r (r) 
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d z 



Therefore : 

h (x,X) 



r r -a.(c+i) - X+r-t 

Jb * 

XUr-i>! 
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a 1 

0 



The density function of x is obtained by integrating 
on A. over its range, zero to infinity: 



g 



C 



^ CC+l) x + r -t 

A. d X 



Let : 



C r 

X ( (Tr-i) 1 




r+.r-t -aCe+O 
X jl cLX 



Z 

ol 2, 
cl X 

X 



- X ( C + 1) 

= (c+ i ) cL A 

- oC B 

"c + 1 

= 2. 

~cTT 



Then : 

g c%) 



2 



XlCr-iV. 



C + i 



X+r-i 

JL 



C •+ i 
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W tr J 
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cC Z 



( X * r - i ) ! 
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ZlCc- 1>‘ cc+i^ 



jC + r 





r?.n 



Since i 

rCxtr) = ( x + r-t ) ! 



6 Q 

/ * 



% + r - 1 -a 
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pCx) 
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X'.Cr-t) 1 . kC+ L / Vc + i/ 



This is another form of the negative binomial distribution 



Letting x = 
probability 



n - r and p 
of k is z 




- [C/ (0 4* 1 ) ] , 



r i 



n-r 



q is 1/CC + 1 ) , the 



[3.7] 



[3,7] is identical to L3.1] and [3.3], though now r is not 
integer. 



Frequently, C3.fi] is developed with the following 



? = i 

a 1 c * 1 
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r» * and n* are not probabilities. Using these parameters. 



the negative binomial becomes t 



( n-)- 

i, r -L 



1' 



t3.S] 



There cr 1 



1 + p* and r is positive, T negative binomial 
cats its name because successive terms in the erpar.sior 



of 



J C C 



V 



are the probabilities P.73 and successive terms in the 
or.nansion of > , , . -m 

( q. - p -> 

are the probabilities [1.7] . 

3.2 Parameters of the retrative Pinonlal Distribution 

mho parameters of the negative binomial [7.7] are 
p and r whereas the parameters of P.7] are p ' and r. The 
Gamma distribution used in the development off the negative 
binomial distribution has parameters C (scale) and r (shape 
The Poisson distribution has the parameter, % , 






3 . 3 Mean and Variance of the Negative Binomial Distribution 



The probability generating 
binomial distribution [3.7] is t 

r 



function for the negative 



P ( 1 - c£ S ) 



[3.9] 
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And , since 
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And , 



P'CD 



r <t 



’herefor© : 



Var Cx) 



r <t t r'z + r<j. 
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[3.11] 



For the alternate form of the negative binomial 



[3,9] , the 


mean 


and 


variance 


generating 


function 
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And, P " ($) ~ _ C''ti)p>(q_'-p's) r 2 (-p') 

P" CO (r»0 r C P ') 2 

Therefore , 

Var(x)= rp't(ru)(p ' )V - r * fp ' )* 

- rp* Cl +p‘) 

Mar Cx) - rj '^ 1 

[ 3 . 13 ] 

i . i 

Stnce c£ = 1 + y 



3.4 



Estimation of the Parameters of the Negative Binomial 
Distribution 



Three methods have been used to estimate parameters 
of the negative binomial distribution! The Method of 
Moments, The Maximum Likelihood Method, and the Proportion 
of Zeros Method. 
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3,4,1 The Method of M oments 



The paranuters have been shown to be p and r and the 



first two moments are estimated from the sample moments. 



then the ratio of the mean to the variance provides an 
estimate of p. That is, if the mean of the sample is m and 



m = rq/ p and noting that g *» 1 — p, an estimate of r is 



r by this method has been derived by Fisher (1941) and 
it's reciprocal is given by: 



3.4,2 The Method of Maximum Likelihood 

The maximum likelihood method of fitting the negative 
binomial distribution has been discussed by Haldane (1941), 
An s combe (1950) and Siohel (1951), Let f^ be the observed 
frequency of n events and 7 . the highest value of n observed. 
The total number of observations, N, is given by: 



mean and variance were shown to be 




If the 



the variance is , then m/S* estimates p, Fince, 



given by r - mpyf 1 - p) « The efficiency of estimating p and 
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The mean of the sample, m, is given by j 

L 



' N 



jL. 



Tin 



i sc 

The likelihood is given 



IT 



P Cn ") 



yn. 

J 



ii 



jid the logarithm of the likelihood, L, is given 



L 




*£ ^ loG v 



£ *"* 
A o 



ff 



]o R J> + J ryj y log ( r-f-X ) 

l %=0 

i- ?2 ^°g ^ 



+ rlog p 



J 



Differentiating with respect to p: 

r 72 

T ' d-p> ' 

«* J 

ZZZ3L, 

j C i -p) 

Equating to zero and substituting m for; 
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and q for 1 - p, gives: 



yyi 



rc L 

F 



[3,101 



The mean of the sample, m, is therefore the maximum 
likelihood estimator of the distribution mean. 



Differentiating L with respect to r, therg 



b r 
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r *dr i 
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(X+T) 



+ JO 



Equating to zero gives: 

*»l-l 



^1 



S-o 



(XI- r) 



N log 



Eliminating p from the equation by solving [3.141 for p 
yields : 

£ j. r j. j* , f 

?? 



n % a + s) 



fi + f» + f, ♦ .*f 



f, + f- 



t 



n. 



f 



n 



( r *■ 1 ) 



( r +■ % - 1) 



0 




The solution of this equation for other than r » infinity 
provides an estimate of r. An iterative process must be 
used by guessing a value for r and iteratively adjusting 
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r. The? estimate obtained by the method of moments is a 
good initial guess. The use of computers has greatly 
improved this iterative process. This method of maximum 
likelihood was used to estimate the parameters of the 
negative binomial distribution in this research. 



3,4,3 The Proportion of Zeros Method 



A third method of estimating r is from the proportion 
of zeros. If £ 0 is the observed number of zeros, then an 
estimate of r (Anscornhe, 1950) is; 

= ( 1 + 

r 



& 

N 



Taking 

yields 



of both sides and putting; 

a = - J °S. fo 

N 

a. = ]<» ( 1 + 23. ) 
r & r 

This equation can also be solved in an iterative 
manner. 





CHAPTER 4 



f 



STATISTICAL TECHNIQUES USED IN THE ANALYSIS OF DATA 
Since extensive use has been made of certain 
statistical techniques, this chapter will explain the 
rationale and subtleties of applying the techniques to the 
data used in this research. 



4.1 Estimating Parameters of the Negative 
Binomial Distribution 



In order to calculate the theoretical frequencies 
of occurrence of events in the negative binomial 
distribution, it is necessary to estimate the parameters 
r and p of the distribution; 



£ CO 



U-i 



n-r* 

-f % 



C 




Using the method of maximum likelihood, it has been 
shown in Paction 3.4 that an estimate of the mean (m) of 
the distribution can be determined by; 



n 



Y 



In Section 3.3, it was shown that the mean of the 
negative binomial distribution was given by; 




49 



a nr’ the .variance by: 



Var (%)- 



I’y takincr the ratio of the rean to the variance of the 
sample data# one can obtain an estimate for the parameter 
p. Thus: 



7 



tie an 



Variance 



r <L 

T 



In order to obtain the estimate of the parameter r 

by the method of maximum likelihood an iterative procedure 

must be used. In Section 3 * fl , it was shown that, the 

parameter r could be determined from the equation : 

N (J.+ 2S) - f | •» f a + f 3 +1^ 



Cr + L> 



ffl.11 



0 






There : 

r ** the parameter 
N = the sample r , i no 

f* — the frequency of occmrrcr-co of 
m = the sample moan 
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The procedure, then, is to guess at an initial value of 
r and solve the above equation, then successively alter 
r until the value of the left hand side of the above 
equation becomes zero. It is sometimes judicious to use 
the value of r estimated using the method of moments 
(Section 3,4), An illustration of the procedure of 
estimating p and r is given below for the following booh 
use data. 



No, of Uses 



Frequency 



0 

1 

2 

3 

4 

5 



85 

21 

12 

3 

4 

7 



The mean of the above data is .79545 and the variance 
is 1,91967, By taking the ratio of the mean to the 

i 

variance, it is found that the parameter p =. ,41437. 

Using r « .400, the value of the left hand side of 
the equation is 2.21559. The estimate of r is then 
increased by ,001 until the value of the left hand side 
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of equation 4,. 1 becomes negative. Table 4.1 shoves the 
tabular values of the numeric value of the left hand side 
of the equation. 

Since the numeric value of the left hand side of the 
equation at r = .427 is .00841 and the numeric value is 
-.00435 for r = .428, we could use linear interpolation 
to determine the r for which the le hand side of the 
equation is zero. This method was used in all analyses 
in which the parameters of the negative binomial 
distribution had to be estimated. 

4,2 Using the Chi Square d Test of Goodness of Fit 

The Chi Squared test is the oldest goodness of fit 
test, A random sample of size n is obtained from the 
population and f • denotes the frequency of x^ in the sample 
If the population is discrete and has the probability 
function, x i}* the expected value, e£ equals 

npj^x^Jt £ ^ will not usually be equal to np^x- J and the 
differences - np|x^ 1 are related to the extent of 
departure of the actual population probabilities frompj^i,]’ 



tabulation of 
n, TTB 



TABLE 4 . 1 

VALUES USED 
calculation 



TO IIJ UBTBA^r 
OF r 



r nunerical valuo 



,400 2.21559 
,401 2 . 1247 !? 
,403 2,03473 



,425 
. 426 
,4 77 
,42 8 
, 4 29 



0.15573 

0.08177 

0,00841 

-,06435 

-.13653 
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The differences are squared, weighted by the reciprocal 



This statistic has approximately the Chi Squared 
distribution with k-j-1 degrees of freedom. The value of 
j being the number of parameters estimated from the sample 
data. 

For a test based on this statistic, we need to be 
able to calculate the probability of obtaining a value of 
the statistic as large as, or larger than, the observed 
value. 

To illustrate the use of this test, it is hypothesised 
that the data shown in Section 4 ,1 is negative binomially 
distributed „ The tabulation of the components of the 
negative binomial distribution with parameters p « ,41437 
and r « .427 are shown in Table 4,2. 



of the theoretical frequencies, np£x^j.,and summed yielding t 




k 






The Chi Squared statistic is 10.9543 
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The probability of obtaining a value as large or larger 
than the observed statistic is .013 (from a Chi Squared 
table with 3 degrees of freedom) . V7e conclude then that 
the data are not well fitted to a negative binomial 
distribution . 



In order to prove the quantity Chi Squared 

k a r 

( f L - np^x^J- ) / np £x-J is distributed as Chi Squared when 

tn 

the null hypothesis is true, it is necessary that the npfx^'s 



he large. That is, the proof is strictly valid when the 
np|x c J‘3 tend to infinity in the limit. Because of this 
many authors recommend that the npj^x^J-'s be greater than 
or equal to 5. Some authors recommend that the 
np £ XjJ *g be greater than 10* The point is controversial, 
however, and Cochran (1954) using nq as the expected 
frequency states that; 



"It is my opinion that these recommendations 
are too conservative and may on 
occasion result in a substantial loss 
of power in the test, I give this as an 
opinion, because not enough research has 
been done to make the situation quite clear. 
However the exact distribution of 
the £(£c - when the expectations are 

small, has been worked out in a number of 
particular cases by Sukhatme (12), Neyman 
and Pearson (3) and Cochran (4), (5), These 
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results indicate that the X 4 tables give 
an adequate approximation to the enact 
distribution ever, when sore ?%■ arc 
lower than 5." 1 



The loss of power from the greater or equal to five 
rule occurs because the rule requires grouping of 
frequencies in cells at the tails of the distribution who: 
frecp.iontly the differences between the alternative 
hypothesis and null hypothesis stand out most clearly# so 
that grouping would cover up the most serious differences 
between the two distributions* Cochran (1°5?!) uses the 
following illustration* Purpose that we have a sample of 
IT- - inn. The null hypothesis is that the data follow a 
Poisson distribution with known moan equal to 1, when in 
fact the data follows a negative binomial distribution: 

■n 



c i - p ) 



where n = 2, q = 1.5# and p * ft. 5. TThat is the chance ol 
rejecting the null hypothesis at the Sf level of 
significance? Cochran presents the data in fa ble-'ft.h. 



If an m t * as low an l.ftft is a 1 loured , 5 cel In are used 
in the Chi Cquared test with ft degrees of freed on 
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the mean is known. To make all m t - greater than S the last 
two classes are pooled and there are 3 degrees of freedom. 
To make all the m^ 1 s greater than 10, the last 2 classes 
are pooled and there are two degrees of freedom. Cochran 
calculates the values of the power function: 



which is a non-central Chi Squared distribution with 
parameter, X , of non-centrality. The larger the value 
of 2, the higher the power. The contributions to X is 
shown in the fourth column for ungrouped data; 5th column 
for grouping the frequencies of the last two cells; and, 
the last column for grouping the frequencies of the last 
three cells. The probabilities of rejecting the null 
hypothesis from non-central Chi Squared distribution tables 
are ,56 for the ungrouped data, ,43 for an m t - greater than 
or equal to 5, and .32 for an m; greater than or equal to 
10. Cochran points out that the power function slightly 
favors the ungrouped data because it is an asymptotic 
result. Cochran then calculates the sample H that would 
be needed in the grouped cases in order to have the same 
power as in the ungrouped case, lie found that an N of 130 
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would be required when the frequencies of the last two 
cells are grouped, and an N of 191 when the last three are 
grouped. These are to be compared against an 11 — inn for 
the ungrouped data. 

Cochran gives the following rule for goodness of fit 
tests for unimodal distributions: 

"Here the expectations will be small only 
at one or both tails. Group so that the 
minimum expectation at each tail is at 
least 1." 

For all uses of the Chi Squared goodness of fit test 
in this research# grouping was kept at a minimum because 
of the few number of cells and because of the loss of two 
degrees of freedom for estimating the parameters of the 
Negative Binomial distribution. Cochran's rule above was 
followed in all cases. 

4.3 The Variance Test 



One of the ways that data distributed, according to 
a negative binomial distribution is distinguished from the 
Poisson is by the magnitude of the variance. 
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distribution has a variance equal to the mean, whereas the 
negative binomial distribution always has a variance greater 
than the mean. These are contrasted with the binomial 
distribution which has a variance which is always less 
than the mean. Rejecting the null hypothesis that the 
variance is equal to the mean will permit one to conclude 
at a specified significance level that there is no reason 
to believe that the data is Poisson distributed. Cochran 
(1954) states that a comparison between the observed 
variance of the observations and the variance predicted 
from Poisson theory will frequently be more sensitive than 
the goodness of fit test. The variance test is made by 
calculating 

c-i % 

The quantity Chi Squared is referred to the Chi Squared 
tables with N-1 degrees of freedom. 

To illustrate the use of the variance test, the data 
in Section 4,1 will be used. The mean of the sample is 
,79545 and the variance is 1,91967. An alternative for 




Chi Squared, if the calculation is made from the frequency 



distribution of Xj, is : 



x 




X 



Where 

j « the cell 

% - the mean of the sample 
f j a* the frequency of occurrence of j 



The calculations for the data of Section <1.3 are tabulated 
in Table *1,4, The calculated Chi Squared is 316,1445?. The 
value of Chi Squared can be determined using the normal 
approximation to the Chi Squared distribution: 

Xf = k (h* ^ kTT )* 



Where : 

p = 1 - significance level 
Xy — the standard normal value 
at probability p 
k = sample size 
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7. xs 



1.65, k is 132, 



At a significance level oi 5% r 
and the value of Chi Squared is calculated to be 159.31. 
Thus, we would conclude that the hypothesis should not b 
accepted, that the variance is greater than the mean and 
thereby assume that the data is negative hinonially 
distributed. 
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CHAPTER 5 




THE COLLECTION OF ROOK DEMAND AND USE DATA 
Several types of demand occur for books in a. library. 
This investigator identifies two major kinds of demand* 
known and unknown . A known demand is a demand by a patron 
which the library system knows has been made. For example, 
if a person charges out a book, the library system knows 
a demand has been made. The particular book has been both 
demanded and used. An unknown demand occurs when the 
library system does not know that a demand has been made. 
For example, a patron using the card catalog has identified 
that the hook he wants is in the collection, looks for the 
book and doesn't find it, lie does not check to see if the 
book is charged out nor does he ask that the book be found 
or held for him. Previous investigators have studied only 
known demand, 

5 * 1 Types of Deman d 

Libraries generally are of two types i closed stack 
or open stack. In a closed stack library (one in which 
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a patron has no access to the library stacks) , a patron 
must identify his demand to a library employee in some 
manner. Generally, he does this by filling out some sort 
of form, for cramp lo , a book call slip. In this type 
library, all demands are known. The demands could he 
satisfied or unsatisfied. The satisfied demand can result 
in a home use in which the patron physically removes the 
book from the library for a period of time or an in-house 
use in which the natron uses the book in the library. 

This type of library can also experience unsatisfied book 
demands » Thus , the library has a record of both known and 
unknown satisfied or unsatisfied demands. 

An open stack library, on the other hand, permits 
the patrons access to the book stacks of the library. 

This type library can also experience both known and unknow 
demands. Both types of demands can be satisfied or 
unsatisfied. However, the open stack library generally 
only has a record of satisfied home use demands. The 
library system may never become aware of unsatisfied demand 
or of satisfied in-house demands. 
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Studies of book use previously conducted always 
consider known home use demands, sometimes consider known 
in-house use demands but never consider unsatisfied demands. 

5 . 2 Met hods Used For Collecting Demand Data 

Previous use studies generally involve random samo.les 
or samples of the books from selected subsets of the entire 
collection. Two methods for collecting data will be 
described using the terminology used by Jain (1967). 

A "check out sample" is one in which the past use 
(generally satisfied home use demands) of books "checked 
out" during a specified period of time are recorded. 

Dawson, Aldrin and Gould (1961) used this type of sample. 

The second method is called a "collection sample." 

This method involves choosing a sample of the collection 
of books in the library or of the collection of the selected 
subsets of book in the library and collecting information 
about past or present use of the books. Fussier and Rimon 
(1961) used this type of approach. 




67 



All previous investigators, with the exception of 



Jain (1967) who used his "relative use" measure previously 
described in Section 2,5, used either the "collection 
method" or the "check out" method for the collection of 
their use data. This investigator collected random samples 
of book demand using the "collection method." 

5.3 Problems and Difficulties in Collecting Demand Data 

Certain problems arise with the collection of all 
types of book demand data caused by the type of library, 
the type of records which are kept and by the method 
of collecting data. With regard to the method of collecting 
data, Jain (1967) examined the advantages and disadvantages 
of the "collection method" and "check out" method and 
shelved the following advantages and disadvantages of each : 



Collection 

Characteristics Sample 



Check-C”t 

Sample 



Can one draw 
inferences about 
the whole 
collection? 



jes 



no 




Collection Check-Out 
Characteristics Sample Sample 

Can information be 
obtained on the rate 
of usage of the same 
group of books over 
a long period of 

time? yes no 



Is it relatively 
easy to design a 
sampling scheme 
and collect data? 



no yes 



Are the problems of 
missing data and 
of lack of control 
on the methods 
of recording usage 
histories in the 

past avoided? no yes 



The collection method is the only way that inferences 
can be made about the entire collection or subsets of the 
collection. 

There are problems caused by the type of library and 
the method in which book usage is recorded. Theoretically 
in a closed stack library, all demand is known- If 
the library keeps the records of all demands, demand studies 
would be no problem. This is not the case. Records 
generally are not kept. Since the data collected for this 
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study were collected in an open stack library, the .fourth 
characteristic shown above will now be considered. Book 
use records are generally available only for hone— use 
demands. A book card or some record is generally affixed 
to the inside back cover of the book and a patron on 
checking out a book will sign his name or a date charged 
or date due is written or stamped into the book. 
Theoretically, this record of home use should provide for 
a good record of past home use of the book. However, the 
amount of space provided for names or dates is finite. 

The cards are generally thrown out when they are "filled 
up" with signatures or the "date due" slips are removed 
when they are "filled up" with dates. 

when looking at these usage records for long periods 
of usage, it is sometimes difficult to determine whether 
the first date stamped is the first home use of the book 
or is the first home use on a new date due slip. That is, 
a date due slip had already been filled up with dates, 
removed and discarded. Other problems sometimes arise such 
as attempting to determine the date at which a book was 
added to the collection. For example, a book with a date 
due slip inside its back cover with an imprint date of 
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1910 and with no record of home use on the date due stir* 
may be: (1) a book which was added to the collection in 

1910 with no home use since that time; (2) A book added 
in 1910 with a thousand uses until I960 at which tine a 
new date due slip was placed in the book; (3) A book which 
was added to the collection in 1968 at which time a date 
due slim was placed into the book, 

5,4 Identification of Books to be^ Included In Random 
Samples 



The books identified for the analyses performed 
during this investigation were identified using a random 
sampling scheme, A random sample of the entire State 
University of New York at Buffalo library collection was 
collected from the library's shelf list in order to studv 
the number of home uses of books during a period of time, 
and a random sample of Library of Congress P class (Language 
and Literature) books were identified to be studied to 
determine the number of demands for the books. 

The population sampled was the library's entire book 
collection. This was performed by randomly identifying 
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the hooks by using the library's shelf list, Ihe shelf 
list consists of armroximately 400 drapers (each 
approximately 16" in length) of 3x5 cards — each 
containing catalog card information about one title. The 
shelf list is arranged in call number order so that. , 
theoretically , the cards are arranged in the same manner 
as the books are on the shelves. 

in order to collect a random sarrmle from the shelf 
list, it was decided to randomly identify an eighth of an 
inch of the total thickness of cards comprising the shelf 

list, pull out a card from this eighth of an inch 
and record the appropriate identifying information. 

To select the random samples, the drawers were 
numbered, the depths of the drawers measured and a computer 
program written to generate as many random numbers as cards 
to be included in the sample. The program then converted 
the random number into a drawer number and a depth (in 
inches) into the drawer at which a card was to be pulled, 

A portion of the output of the program is shown in 
Table 5,1, The first line in the Table identifies a card 
in the 50th drawer at 4/8 of an inch into the drawer. 
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The 



TABLE 5.1 



A PORTION OP THE RANDOMLY 
GENERATED SHELF LIST 
DEPTHS FOR RANDOM 
TITLE IDENTIFICATION 



Drawer 

Number 


Inches 


50 


0 


4/8 


51 


1 


4/8 


51 


9 


3/8 


53 


5 


3/8 


53 


7 


7/8 


53 


14 


7/8 


54 


11 


1/8 


54 


12 


7/8 


55 


1 


5/8 


55 


2 


3/8 


55 


5 


7/8 


55 


12 


1/8 


55 


14 


1/8 
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second line identifies a book in drawer 31 at 1 4/8 inches 
into the drawer. 

Two hundred random shelf list depths were generated 
at a tine. For example, if 1000 random depths were desired, 
five sets of 200 random depths were generated. Each 200 
locations were randomly generated over the entire shelf 
list depth. 

This was done because shelf list drawers are, 
necessarily, not full and prior to selecting the desired 
cards, it was necessary to compress the cards toward the 
front of the drawer. In all drawers , then, the back portion 
of the drawer would contain no cards and the randomly 
generated location may be empty. Two hundred (200) random 
locations might only result in 78 cards or 114 cards, etc. 

Upon identification of a card, the call number was 
recorded as well as some other identifying information, 
such as number of copies, location (branch library, etc.) 
and a short title. This information was then used to 
retrieve the book for the book demand data to be used in 
the analyses. 

O 

ERIC 



74 

95 



5,5 



Samples of Book Demand and Uses 

Tnis investigator collected demand for known home 
use of books for a sample of 702 books from the library's 
entire collection for the period 9/1/66 - 8/31/67. In 
addition, all demands (known and unknown) were recorded 
for the P class (Language and Literature) books using a 
questionnaire method for the period March 3 # 1968 through 
May 11, 1968, During this period, 12,R9R demands were 
recorded for 10,422 different books. The P class shelf 
list was then sampled and 732 books randomly selected, for 
analysis , 



5,5,1 The Collection of Home-Use Book Demand Data For The 



SUNY Buffalo Collection 



Ten sets of 200 random depths were generated in order 
to identify call numbers of titles from the entire PUNY 
Buffalo shelf list. The procedure described in 5.4 was 
followed. Because of empty space in shelf list drawers, 
1167 titles were identified, A search was made in the 
stacks and; (1) if the book was found, the number of 
charges (home use demand) as recorded in the back of the 
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book was copied; (2) if the book was not on the shelf, 
the book was recalled if it were charged out to another 
patron and the number of charges as recorded in the back 
of the book was copied upon its return; (3) if the book 
was not on the shelf and not charged out, a search for the 
book was made one week later and the number of charges 
recorded if it were found. If it was not found upon a 
second trial, it was assumed to be a missing book. 



A tabulation of the number of books identified by 
Library are shown in Table 5,2, For example, 48 titles 
were searched for the Health Sciences Library, 44 were 
found, 3 were either not found or not returned by the 
borrower upon recall, 1 did not circulate. 



The 1020 books which were found were checked to see 
if they were charged out for home use during the period 
9/1/66 through 8/31/67, Because of the missing data or 
uncertain data problem described in Section 5.3, 318 books 
were eliminated from consideration. The investigator was 
sure that a book was on the shelf available to patrons if : 
(1) a charge date was recorded on the date due slip earlier 
than 9/1/66; (2) the book contained a University of Buffalo 
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stamp (this means it was cataloged prior to 1963) ; (3) an 

identifying book plate was in the book (book plates were 
not put in books later than 9/1/66) « The investigator was 
sure that the book was not on the shelf during 9/1/66 
through 8/31/67 if the imprint date was later than 1967 
(for example, 196 8), and was not sure if the book was on 
the shelf for the specified period if none', of the above 
described conditions were appropriate. 



A tabulation of the number of home use demands for 
the period by sample is shown in Table 5,3, For example, 
sample 6 showed that 52 books were not charged out during 
the period 9/1/66 through 8/31/67, 14 were charged out 
once, 7 were charged out twice and one book was charged 
out six times. 

5 , 6 Collection of Book Demand Data for Language 
and Literature (F Class ) Books 

The Language and Literature collection at the 
SUNY/Buffalo Libraries comprises the largest class of books 
within the library system (perhaps one quarter) of the 
450,000 title collection. In addition, the entire class 



TABLE 5.3 



HOME USE DEMANDS OP TEN RANDOM SAMPLES 
OF BOOKS DURING THE PERIOD 9/1/66 - 8/31/67 



Sample 

Number 0 



1 



Number of Home Use Demands 
2 3 4 5 6 



7 



Total 



1 

2 

3 

4 

5 

6 

7 

8 
9 

10 



41 


9 


40 


8 


54 


12 


61 


11 


59 


12 


52 


14 


55 


14 


53 


6 


54 


13 


61 


13 



2 0 0 

2 10 

6 12 

5 2 0 

4 0 2 

7 0 0 

5 0 0 

3 2 0 

3 2 0 

5 10 



0 0 0 
0 0 0 
0 0 0 
0 0 1 
0 0 0 
0 10 
0 2 0 
0 0 0 
1 0 0 
0 0 0 



52 

51 

75 
80 
77 
74 

76 
64 
73 
80 



Totals 530 



112 42 9 



4 13 1 



702 
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(with the exception of fiction books) are shelved on one 
level of Lockwood Library, The level has one stack 
entrance. There is an emergency exit which is not to be 
used by patrons. In addition, there are essentially no 
reader spaces on the level so that patrons wishing to use 
P Class books must physically remove the books from the 
stack level. Because of these reasons, the P Class books 
were selected for intensive demand study for this 
investigation. During the time of the data collection 
(Spring 1968), Lockwood Library was open 96 hours per week, 
(From 7:30 a,m, to 11:00 p,m. on Mondays through Fridays ; 
from 7:30 a.m, to 5:00 p,m. on Saturdays; and 1:00 p,m, 
to 11:00 p.rt. on Sundays), It was decided that the best 
way to get demand data (both known and unknown) was to ask 
patrons what they were looking for. In order to accomplish 
this, data collectors were stationed at the stack entrance. 
As each patron entered the stack entrance, they were given 
a questionnaire (See Figure 5,1 and Appendix A), The 
questionnaire was designed to find out what the patron 
looked for, what he found, and whether he would use what 
he found in the library or charge it out for home use. 

Data collection was begun on February 13, I960 and continued 
through May 11, 1968, Data was collected on each day except 



FIGURE 5.1 



QUESTIONNAIRE 

Date Tim e 

TH6 RESULTS OF THIS QUESTI ONNA1RI WILL HELP THE LIBRARY IMPROVE ITS 3ERV1CI* 
YOUR COOPERATION WILL BE APPRECIATED. 

For ftjch book which you looked for, found ©r used, list the corqplete call nymb#f on the lift 
and circle your answer to each a# the three questions on the right* Please circle the arwer 
to fb® question even if you did not find the book you wonted* 



Coll Nunijer Question Answer 

• —*— * - — ■ 1) Did you hove the coll number when YES 

you coma into Hie stacks? *,q 



2) IF you hod the Coll number, did you YES 

find the book you looked for? 



3) If you found (or would have found) the Lb® It In the Library 

book, will you (would yog)? Cliorg* it Out 

Not sure if I wlH(erould) 
chorge it out or not 



Coll Number 


Question 

1) Did you hove the coll number when 
you came into the stacks? 


Answer 

YES 

NO 






2} If you had the coll number, did you 
Find the book you looked for* 


YES 




NO 




3) If yog found (or would have found) the 


Use It in the Library 




book; will you (would you)? 


Chorge ft Out 

Not tore if 1 will (would) 
charge it out or not 


Call Number 


Question 


Answer 


- 


1) Did you hove the coll number when 
you came into the stacks? 


YES 

NO 




2) If you had the coll number, did you 
find the book you looked for? 


YES 

NO 




3) If you found (or would have Found) the 


Use It In the Library 




book, will you (would you)? 


Charge it Out 

Not sure if 1 will (would) 
charge it out or not 


Call Number 


Question 


Answer 




1) Did yog have the call number, when* .. 
you came into the stacks? 


YES 

NO 




2) If you had the call number, did your 


YES 




find the book you looked for? 


NO 




3) If you found (or would hove found) the 


Use it In the Library 




book, wifi you (would you)? 


Charge ft Out 



Not sura If I wfll(wauld) 
charge It out or'not 




March 26, 1968 (Easter Sunday) when the library was closed. 
The questionnaire was checked for a period of 18 days. At 
that time, it was decided to ask the patrons for their 
student number if they were a student or their name if they 
were a faculty member. It was discovered at that time that 
books which were not recorded on the questionnaires were 
being charged out. In addition, call numbers were being 
imporperly recorded and coded by the data collectors. 
Detailed instructions were prepared and given to the data 
collectors (Appendix B) . Each demand was recorded, coded 
and keypunched. The punched card format is shown in 
Appendix C. All data collected beyond the eighteen-day 
period were use'd for the analyses in this research. 



In addition to the demand data collected on the 
questionnaires at the stack level, the charge cards for 
P Class bocks were checked daily to see which P Class books 
were charged out. The names or numbers of the person 
charging the book were compared with the questionnaires to 
determine if any individuals were not completely filling out 
a questionnaire. The data collection procedure was completed 
on May 11, 1969 and produced 51 days worth of demand infor- 
mation. Seven thousand eight hundred ninety-three (7,893) 
questionnaires were distributed during the 51 -day period. 






P Class stack level is shown in Table 5,4. The table 
shows i the day of the study; the number of questionnaires 
distributed that day (this number includes a questionnaire 
for each patron who refused to fill out a questionnaire) ; 
the number of questionnaires not returned; the number that 
were useable (some questionnaires did not include P class 
books or contained unintelligible information and were 
thus not useable)'; the number of questionnaires which 
contained no P's; the number of questionnaires which were 
refused; the number of questionnaires for patrons who were 
browsing but who found nothing; the total number of demands 
for the day; the total number of recorded demands for the 
day; the total number of unrecorded demands (these were 
for books charged out but which were not recorded on 
questionnaires and are traceable either to those refusing 
to fill out a questionnaire or unintelligibly filling out 
a questionnaire) ; and the number of unrecorded traced 
demands (these are for books charged out which were 
traceable to particular questionnaires but which were not 
recorded on the questionnaires) • The number of unrecorded 
demands appears to be high, ranging from 12 % to 53? of the 
total demands (averaging 33?) . The total demands then are 
somewhat deflated and the number of demands would be 
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TABLE 5.4 (CONTINUED) 



FILMED from e 



best AVAIIABEE OjOFY 



l.* if e e» w 

=5 * * * * 

r> ” Q *T r e~ ^ 



^ n - lp vT P S 

j f r R If 't' 






****** 



****** 



ZD u 

n (/ 

i ? 

►= C, 



m its iP 



fu ru N (V IP 



i to 
Z? 
& Lt- 
<3 UJ 
* & 



W fS.- ir 



IP IT »P 



CM CM 



TO fU ru 



PM PU 

Lp p" 



*1 |P CM 



«J CP w 

ei » 3 
z a 
U D 



»-*. tr- * 
H ^ 
cat/ 



tv ip 



(V tVi PJ 



lO Uf\ CM UP 



u • 

_* ■= 

rr tr 



IV If* ip 



m ip 

•■ * 

lf> CM 



tv* CM CM 



fy to pj 



* u 
Cr y ! 

r s 

= e* 



a 

S' 



CM PJ 



d 

ERIC 



ip 

in 



increased if it were possible to get patrons to cooperate 
more with questionnaire surveys. On the other hand, the 
probability of a person having an unsatisfied or satisfied 
in-house demand given that he had a home use demand was 
calculated to be approximately .2, which is not high 
enough to affect the curve fitting analysis which will be 
performed in Section 6*2 especially since there is no reason 
to believe that unrecorded demands occurred in any systematic 
manner. It is reasonable to assume that the unrecorded 
demands occurred in a random manner. This means that although 
the mean number of demands and actual frequencies of demands 
may be understated, the analysis performed should be reasonably 
valid. A summary of the totals for the 51— day period are 
shown in Table 5.5. 

5.6.1 Records of P Class Demand Data 



After keypunching and verification, the demand data 
was entered onto magnetic tape and maintained in day-time 
and call number order, A page listing *Ton this file is 
shown in Figure 5,2, The punched card format and master 
file tape record format are shown in Appendix C, 



TABLE 5., 5 



QUESTIONNAIRE SUMMARY OF P CLASS DATA 
Questionnaire 



Status Number 



Not Returned 805 
Useable 3,613 
Without P's 2,029 
Not Useable 463 
Refused 636 
Browsing 293 



Total 7,893 



Demand 



Status 


Number 


Recorded 


7,759 


Unrecorded 


4,378 


Traced 


761 


Total 


12, 898 





P ercent 

10,27 

46.08 

25,88 

5.91 

8.11 

3.75 

100.00 



Percent 

60.16 

33,94 

5.9 0 

100.00 
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SAMPLE PAGE FROM MASTER FILE LISTING 
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5,6,2 Identifying a P Class Random Sample for Analysis 

The data collected represents demands for books 
demanded. Ten thousand four— hundred twenty (10,420) 
different books were demanded during the 51— day period. 

The collection comprises over 125,000 titles. In order 
to analyze the demands for books, a random sample of titles 
was selected from the P Class shelf list using the shelf 
list sampling procedure described in Section 5,4. The 
random shelf list locations were generated in samples of 
200 as described in Section 5 , 4 . Five random samples were 
collected. After recording the call numbers from the shelf 
list, the call numbers were checked against the call 
number ordered master file to see if the books were demanded 
during the period of the study. A summary of the number 
of demands for each sample is shown in Table 5.6, 
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TABLE 5.6 



SUMMARY OF RANDOM SAMPLE OF P CLASS BOOK DEMANDS 



Sample 

Number 



0 



Number of Demands 
1 2 3 



4 Total 



131 


10 


3 


137 


16 


0 


122 


13 


4 


125 


16 


2 


135 


10 


4 



0 0 144 

0 6 15 3 

2 0 141 

0 0 143 

0 2 151 



Totals 650 



65 13 2 



2 



732 




111 
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CHAPTER 6 

ANALYSIS OF FREQUENCIES OF KNOWN AND 
UNKNOWN DEMAND FOR BOOKS 
The analyses performed in this chapter use the 
collected random sample of known demand (described in 
Section 5.5.1) and the random sample of total demand for 
P class books (described in Section 5.5.2) of the collection 
of the State University of New York at Buffalo Libraries . 

6.1 The Known D emand Sample 

The known demand sample consists of 702 books randomly 
identified in the SUNY/Buf f alo shelf list. The frequency 
distribution of the number of recorded charges are shown 
in Table 6.1, The mean number of known demands is .36319. 
The variance is .71886. We hypothesize that the number 
of known demands for books in the sample can be described 
by the negative binomial distribution. Using the method 
of maKimum likelihood,, the parameter, r, of the distribution 
was found to be ,47066. P was estimated using the ratio 
of the mean to the variance and was found to be .53305. 
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TABLE 6.1 

FREQUENCY DISTRIBUTION OF KNOWN DEMANDS FOB 
A SAMPLE OF BOOKS FROM THE 5UNY-BUFFA.LO COT, LECTION DURING 
THE PERIOD SEPTEMBER 1, 1966 - August 31 , 1967 



No, of 

Known 

Demands 




0 

1 

2 

3 

4 

5 

6 
7 



530 

112 

M2 

9 

4 

1 

3 

1 



Total = 702 





6.1.1 The Chi Squared Goodness of Fit Test 

The negative binomial distribution with parameter - 
p = .53305 and r — .47066 was fit to the known demand data 
sample. The Chi Squared table is shown in Table 6,2. The 
Chi Squared column shows the individual components of Chi 
Squared, The calculated Chi Squared is 6.7451. The P value 
(the probability of getting a value of Chi Squared as high 
or higher than the observed) determined from a Chi Squared 
table was found to be .24. Since the 7 demand cell had a 
theoretical frequency of less than 1 , the frequencies of the 
6 and 7 cells were combined into cell 6. The theoretical 
cell frequencies are less than five in two of the cells. 
Grouping the frequencies of the last two cells would have 
resulted in a reduced calculated value of Chi Squared of 
3.6526, At 2 degrees of freedom, the P value would be ,17. 
Doth are reasonably high values of P, A complete Chi Squared 
Table is shown in Appendix D. The results of the test 
indicate that the known demands for the sample of books are 
negative binomially distributed. 



6.1.2 The Variance Test 

The variance test was run in order to test the 
hypothesis that the variance was equal to the mean. The 
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TABLE 6 . 2 



GUI SQUARED GOODNESS OF FIT TABLE FOR A NEGATIVE 
BINOMIAL FIT TO THE FREQUENCIES OF KNOWN DEMAND 
FOR A SAMPLE OF ROOKS FROM TEE PUNY- BUFFALO 
LIBRARY COLLECTION DURING TITE PERIOD 
SEPTEMBER 1, 1966 - August 31, 1967 



Observed 



Theoretical 



Value Frequency 



P robability 




Chi 



0 

1 

2 

3 

4 

tf 

6 



530 

112 

42 

9 

4 

1 

4 



*74370 


522.077 


, 16345 


114.742 


.05612 


39. 396 


.02158 


15.149 


. 00874 


6.135 


. 00365 


2,562 


. 00276 


1,936 



Total Chi Squared = 6 
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Squared 

1203 
0655 
1721 
4 9 59 
7433 
9526 
19 54 

7451 



alternative hypothesis was that the variance was greater 
than the mean, thus, 

H 0 2 Variance equals the mean 

A Q : Variance is greater than the mean 

The table of computations for the variance test is shown 
in Table 6.3. A complete Chi Squared calculation table 
is shown in Appendix E. The calculated value of Chi Squared 
is 1322.638, whereas the theoretical Chi Squared calculated 
using the normal approximation to Chi Squared at a 
significance level and 701 degrees of freedom is 7 6<?. We 
would thus not accept the hypothesis that the variance is 
equal to the mean and would conclude that the variance is 
greater than the mean. 

6,2 Sample of 3? Class Book Demand s 

A sample of P Class books randomly identified from 
the P class shelf list was matched with the magnetic tape 
master file in order to determine the number of times that 
the 732 books were demanded during the period March 3, 

1968 through May 11, 1968. The distribution of the number 

95 
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TABLE 6.3 



VARIANCE TEST CABLE FOR K1TOWH DEMAND 
SAMPLE OF BOOKS DEMANDED FROM 
SEPTEMBER 1, 1 96 6 - AUGUST 31, 1967 



Value 



Frequency 



Chi Squared 



0 530 

1 112 

2 42 



201.930 

112.635 

288.947 



3 

4 

5 



9 

4 

1 



162.027 

137.503 

55.998 



6 

7 



3 248.609 

1 114,990 



Total Chi Squared =1322,638 
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of demands is shown .in Table 6.4, The mean of the 
distribution is .14344 and the variance is .20785. b7e 
again hypothesize that the demand is negative binomially 
distributed and, using the method of maximum likelihood, 
estimate the parameter r to be .41276. The parameter p 
was estimated, using the ratio of the mean to the variance, 
to be ,69011. 



6.2,1 The Chi Squared Goodness of Fit Test 



The negative binomial distribution with parameters 
r ~ .41276 and p * ,69011 was fit to the P class demand 
data. The Chi Squared table is shown in Table 6.5. No 
grouping was done in the tail of the distribution. The 
calculated value of Chi Squared was 6.2817, The P value 
was determined from a Chi Squared table at 2 degrees of 
freedom to be .048. Grouping the frequency for cell 4 
with cell 3 would have given a P value of .921. The data 
is not exceptionally well fitted by a negative binomial 
distribution. 



TABLE 6.4 



FREQUENCY DISTRIBUTION OF DEMANDS 
FOR P CLASS ROOKS DURING TITE 
PERIOD riARCII 3 , 19 SB - MA.Y 11, 196 ? 



{ 

\ 



No. of 
Demands 



Frequency 



0 

1 

2 

3 

4 



*50 

65 

13 

2 

2 



Total = 732 
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TABLE 6.5 



CIII SQUARED GOODNESS OF FIT TABLE FOR A NEGATIVE 
BINOMIAL FIT TO THE FREQUENCIES OF DEMAND . FOR TIIE 
SAMPLE OF P CLASS BOOKS DEMANDED DURING 
MARCH 3, 1968 - MAY 11, 1968 



Observed 

Value Frequency 



0 

1 

o 



650 

65 

13 

2 

2 



Theoretical 

Probability Frequency 



. 85805 
. 10975 
. 02402 
. 00599 
. 00219 



628.093 
80.337 
17.583 
4 .385 
1.603 



Chi Squared 

.7641 
2.9380 
1 .1944 
1.3969 
. 0983 



Total Chi Squared — 



6.2817 



6,2.2 The Variance Test 



The variance test was again run in order to test the 
hypothesis that the variance is equal to the mean. The 
results of the variance test are shown in Table 6.6, The 
calculated Chi Squared is 1062.508. At a 5^ level of 
significance and 731 degrees of freedom, the theoretical 
Chi Squared was calculated to be 795. The normal 
approximation to Chi Squared was used to calculate the 
theoretical Chi Squared. Thus, we would again not accept 
the hypothesis that the variance is equal to the mean and 
conclude that the variance is greater than the mean. 

6 , 3 Analysis of the Distributions of Known and Unknown 
Demand 



On the basis of the variance test alone, we would 
conclude that the variance is greater than the mean which 
provides suspicions that the delta is negative binomially 
distributed. The results of the Chi Squared goodness of 
fit test provides reasonably strong evidence that the 
known demands (Section 6.1) are negative binomially 
distributed, but weak evidence that the total demand (known 
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TABLE 6.6 



VARIANCE TEST TABLE FOR DEMANDS FOR 
P CLASS SAMPLE OF BOOKS DEMANDED FROM 
MARCH 3 , 1 96 B - MAY 11, I960 



Value 



Frequency 



Chi Squared 



0 



650 



1 

2 



65 

13 

2 



/} 



2 



92.950 
333. B40 
313.49 
114*16 
20 8 . 6 . > 



Total Chi Squared = 1062. 50B 
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and unknown) for P class books (Section 6,2) are negative 
binomial ly distributed. We are hampered in the latter 
goodness of fit test with few cells and the fact that we 
lose 3 degrees of freedom because of estimating two 
parameters of the distribution. This is typical of book 
use studies since exceptionally large samples are required 
before a sufficient number of values are found in the tail 
of the distribution to reach sound statistical conclusions. 
The fact that we accept the alternative hypothesis that the 
variance is greater than the mean and have evidence from the 
f Chi Squared tests., the demands for both groups of books 

are negative binomialiy distributed. 



6,4 Implications of Negative Binomialiy Distributed 
Demands 



In deriving the negative binomial distribution in 
Section 3.1, it was shown in equation [ 3 . 4 ] that if the 
probability of 0, 1, 2, .... events follow a Poisson 



distribution with parameter A t 



£ (x) 




Q » 1 1 
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and the mean number, X., of events vary in such a wav that 
they can be described by the Gamma probability distribution 
with density function [3,5]: 






= c 



r<» 



~c y> « — -I 
JL Tv- 



Then, the joint density of x and X is the product of the 
above equations, and leads to a negative binomial 
distribution of the variable x. Thus, concluding here 
that the demands for groups of books is negative binomial ly 
(as was done in the samples analyzed in this section) , 
permits us to at least assume that the number of demands for 
individual books varies according to a Poisson distribution 
and that the demand rates vary from book to book according 
to a Gamma distribution. 



* 
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CHAPTER 7 



DAILY DEI'iAND FOR P CLASS BOOKS 



Using the master file of demand data for P class 
books, ten random samples of 100 demanded books were 
collected. The daily demand for each of the samples was 
tabulated and analysed to determine if the number of demands 
per day for each of the samples was Poisson distributed. 

The Chi Squared goodness of fit and the variance tests 
were used to determine if the number of demands per day. 



X, was distributed according to 




-JL> UA. 



X! 



where s 

JX — the mean daily demand 
for the sample 

X = the number of demands 




104 



-j‘- v -.J u J 



7 • 1 



Randomly Selecting Books to be Included in the Sanpl 

The P class books demanded during the 51— day period 
from March 3, 1968 through May 11* 1968 were numbered from 

1 to 14,222, Tan sets of 100 random numbers were generated 
within this range identifying those books which would be 
included in each of the samples. The number of daily 
demands was tabulated for each of the books in each of the 
samples. The demand frequencies for each of the samples 
are shorn in Table 7.1 together with the mean demand and 
variance of each of the samples of 100 books. 



In general, the demand distribution did not exhibit 
very long tails except for sample 3 in which there were 
13 demands on one day. The means and variances were 
calculated using the actual data and not the data grouped 
into the greater than or equal to seven category. 



<y 

o 

c; 


VP 




^3- 


co 


co 




LTi 


IT, 


c- 


CJ 


ro 


ro 


c:r 




r-- 


vp 


CO 


Cv 


CJ 


r- 


c. 


*H 


• 


* 


• 


• 


• 


• 


• 


• 


• 


• 


u 


cj 


LTi 


co 


cj 


co 


cn 


^d- 


•=J- 


CJ 


c< 


fd 
**- , 























a 










CM 










Q 










fr — * 

H « 


Ci ci 


T“ 


r- 




< 


fdl vp 


1/'; 


C\ 


O 


£ CO' 


cdI * 


« 


• 


o 


CM VL 


£' 1 CJ 


*— 


Cs! 




CM ff 1 . 









S ui 

A. ^ 
Hi 

s c 
Pm c 
o; 

Pm cc 

o 

< 
LI H 3 
04 CJ- 



a 

f— i CD 
CM ^ 
£4 *H 
fd C/. 

l : 



c- 


HH 


CO 




Pm 


o 

Hr= 

f t 


Oh VP 
Q0 


! cj 


h5 

03 


c; 

Cj 

C’ 


a t*‘“ 
04 

04 - 


VPl ^ 

rn 


Eh 


2 


Cj co 



Oh 5 £4 

k c 

cm CM 

§ s ^ 

cm o 

a 



g 



Em 

c 

cm 

8-5 

Ph 



^4 

IH 

CM 

•D 

a 



to 



to to in it; lo in t=n m in ld 



OJ o 



VO I ^(NfMf^r'Cr'C! n(V 5 

*3 

Cl in I n m sr r- vd r: r' n 

fd 

£ 

<y 

P ^1 vev£r^vD^iftif)Mr»i 

>i 

-H 
fd 

Q Co? 



iH 

O 

SH , 
a; col 



CO G i VP5 ©D r- O 



cn o 



C n C CO r« ro n n 



w 



0 



^ ^o^r^ooi r- e\ 



cj m ^ h- co od ci ^ 



<u u 

r~i 4 > 

t*s 

fd P 

to 0 : 



Nn^in^h-eociO 






O 

ERIC 



106 



1 



: 0 ne book had 1 3 demand 



7.2 



The Chi Squared Goodness of Fit Test 

A Chi Squared goodness of fit test was run on each 
of the samples. Because of the low theoretical cell 
frequencies in the tails of the distribution (less than 
one) # it was necessary to group the data in all hut the 
9th sample. These data were grouped into the "greater 
than or equal to six" cell except for sample 3 in which 
the data was grouped into the "greater than or equal to 
seven" cell. Table 7.2 shows the sample number * calculated 
value of Chi Squared (Chi Squared Observed) and P values 
for each of the calculated Chi Squared. The P value is 
the probability of getting s. value of Chi Squared as high 
or higher than the observed if the hypothesis is true. 

A tabulation of the Chi Squared goodness of fit tests for 
all samples can be found in Appendix E* 

7.3 The Variance Test 

The variance test was run on each of the samples in 
order to determine whether there is sufficient evidence 
to reject the hypothesis that the variance is equal 
to the meap. The alternative hypothesis is that the 
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TABLE 7 , 2 



SUMMARY OP THE CHI SQUARED GOODNESS OF FIT TP 
FOR A POISSON FIT TO THE FREQUENCIES OF DAILY t 
FOR SAMPLES OF BOOKS DEMANDED DURING THE 
PERIOD MARCH 3, 196 8 - MAY 11, 196 9 



Sample Observed 

1 lumber Chi Squared 



Degrees 
of Freedom 



P 

Value 



3 
4 , 



16 
16 
5 . 12 
3 6 29 
8.74 
6.4 8 
1 . 36 



5 

5 

5 



,69 
, 5 3 
,41 
,66 
13 
27 
93 



13.71 

1.59 

5.05 



5 .02 

5 .90 

5 . 33 
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variance is greater than the mean . Concluding that the 
variance is equal to the mean is important since we are# 
in effect# testing to see if the daily demand is Pcisson 
distributed. The values in the tails of the distributions 
were not grouped for this test. A summary of the sample 
number# means# variances# calculated Chi Squared (Chi 
Squared Observed) # degrees of freedom and F values are 
summarised in Table 7.3. The results of the complete tests 
are shown in Appendix I. The P value is, as before, the 
probability of getting a value of Chi Squared as high or 
f ' higher than observed. 



7.4 Analysis of Daily Demand 



Wine of the ten Chi Squared goodness of fit tests 
would tend to confirm the hypothesis of Poisson distributed 



daily demand since they showed reasonably high P values. 

Sample eight has a low P value. The variance test, on the 
other hand, results in the acceptance of the variance equals 
mean hypothesis at the 5% level of significance in only six 
of the ten samples. Samples 2, 3, 5 and 8 would tend to be not 



accepted as having their variances equal to the mean, A 
closer look at the results of both the Chi Squared goodness 
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TABLE 7 , 3 

SUMMARY OF THE RESULTS OF THE VARIANCE TEST 
RUN ON SAMPLES OF DAILY DEMAND FREQUENCIES 
OF P CLASS HOOKS DEMANDED DURING THE PERIOD 
MARCH 3, 1963 - MAY 11, 1963 

Sample Chi Squared Degrees of P 



Number 


Me an 


Variance 


Observed 


Freedom 


Value 


1 


2.63 


2.36 


44.80 


50 


.677 


2 


2,51 


5,01 


74.41 


50 


.015 


3 


2.72 


3.14 


88,20 


50 


. 005 


4 


2.24 


2.78 


62,26 


50 


.119 


5 


2.4 3 


4.17 


85.76 


50 


.005 


6 


2 .33 


3.63 


62.29 


50 


.118 


7 


2.31 


3.34 


61,80 


50 


, 128 


8 


2.43 


4.25 


72.60 


50 


.071 


9 


2.41 


2.77 


57.37 


50 


,124 


1 0 


2.37 


2,92 


61.50 


5 0 


.134 
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of fit test and the results of the variance test shown in 
Appendices II and I reveals some interesting facts. Sample 2, 
which had a good P value on the goodness of fit test and 
has a low P value on the variance test had frequencies of 
7 and 9 pulled into the greater than or equal to siv cell. 
Looking at the variance test table shows that the bulk of the 
Chi Squared is caused by the value of 9 in the table. 
Eliminating this value would lower the total Ch c Squared which 
would give a P value closer to .12 which would be accepted 
at the 55 level. Sample 3 has similar characteristics. 
However, here the distribution would have a better fit if the 
one value of 13 were eliminated. Samples 5 and R exhibit poor 
values of Chi Squared for most of the cells in the goodness 
of fit and variance test analyses. A check on confidence 
limits for daily moan demand was made and mean demand was out 
of control on days 51, 50 and 66 and almost out of control on 
day 52, This check was made using the first five samples 
(500 books) and calculating a mean demand per day of .254. 

The standard deviation of the daily mean demands was 
calculated to be . 1058. Using 9 5 5 confidence limits for the 
daily moan demands with the normal distribution, the limits 
were .0467 and .4613. No days were outside the lower 
confidence limit. Day 51 showed a mean demand of .520, 



day 5 2 showed .440, day 60 showed .480; and day 66 showed 
.520, Interestingly, the problems with the variance test 
in samples 2 and 3 occurred with frequencies for both of 
those days. We could conclude that something peculiar 
occurred on those days to cause a shift iri the mean demand 
for books. One could possibly conclude that the mean demand 
varies on a daily basis. If this is so, the daily demand 
data %vould fit a negative binomial distribution if the 
demand rates for the group of books varied from day to day 
according to a gamma distribution. A further point could be 
made here concerning a possible shift in the mean demand 
late in the semester. All four of the high mean demand days 
occurred in the last 20 days of the 51 -day collection procedure. 
Although analysis of the demand data, summary appeared to 
indicate a slight shift in the number of demands late in the 
semester, no attempt was made in this study to determine 
where the shift occurred. Also, there appeared to be no 
significant day to day variation in mean demand although 
the point was not formally pursued. The data collected 
in the ten random samples, does appear to support, although 
not exceptionally strongly, the Poisson distributed demand 
hypothesis. This evidence, again, does not support the 
hypothesis that the demand for individual books is Poisson 
distributed. It does support the hypothesis that daily 



d 

j 



demand is Poisson distributed* Not rejecting this 
hypothesis 9 hcv/ever , does provide evidence that the demand 
for individual hooks is Poisson distributed. This point 
will be pursued in Section 7.5. 

7,5 Significance of the Distribution of Daily Demands 

It is assumed here that the demand , * * . , , for 
individual books are independently distributed with 
parameters ( JLL^ . Then* 

n. 

* “ ' %c 



has the Poisson distribution with parameter. 






i-l 




Consider the case for n = 2. The variables x ( , x ^ takes 
the values 0, 1, 2, We find the probability that 

their sum (x, + x t ) is equal to r. The probability that 



x, has the value s and x^ the value (r-s) is: 

p[ s} P[x t * ^ yU. 



(r- s) 
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given 



The probability that x = r is given by sunning the above 
equation over all values of s (0 to r) : 



r S (r- S) 

Jb JL 




3 -o 



S 1 Cr-S>) 



l 



CAX. S C r ~ S) 

SI Cr-S)! 

S = o 



[7,1] 



the binomial expansion of (/ X , + /X i.) is* 



S (r-S) 
M t p. t 



s ,o 




r 




S’O 



S Cr-s} 

-/<*■. JJ-z 



SI (V'S) - 



the summation in [7.1] is 

z, )' 



and # 



p{z= r} 



- Q.-r/Q* 

r 



which is a Poisson frequency distribution function with 
parameter ( j± % + - ^his can be extended to n values. 
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Thus, when we tent the daily demand for a sample of 
boohs (not necessarily random), we are, in addition to 
showing that the demand for the whole sample is Poisson 
distributed, providing evidence that the demands for the 
individual books comprising the sample are Poisson 
distributed. It could happen that seme arbitrary 
distributed demands for books could combine in such a way 
that the demand for the whole sample is Poisson, but it 
is unlikely. 
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CHAPTER 8 



BOOK USE IN SEVERAL INSTITUTIONS 
As part of their study, "patterns In The Use Of Books 
In Large Research Libraries," Fussier and Simon collected 
data and made comparisons of book— use in several 
institutions. They used their collected data to attempt 
to find out if their findings could be generalized to other 
libraries, 

8.1 The Fussier and Simon Data 



The Fussier and Simon data collection procedure 
involved deriving lists of books that were held at the 
University of Chicago library and other university 
libraries. They compared sample lists in biology, Teutonic 
languages and literature, and philosophy against the 
holdings of the Yale University library; the lists of 
physics, Teutonic languages and literature, and economics 
against the holdings of the Northwestern University library 
and the lists of economics, Teutonic languages and , 

literature, and biology with the lists of the University 
of California at Berkeley library. The original University 
of Chicago lists were developed by a random systematic 
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sampling scheme for selecting monographs for the use studies 
in the University of Chicago library. For each comparison 
sample, the titles on the original list that were held by 
both institutions formed the new sample, Each comparison- 
sample formed a group of similar items that were available 
to patrons in different institutions. Fussier and Simon 
selected the particular libraries so as to include one 
much larger library, one smaller library, and one about 
the same size as the University off Chicago library. They 
then attempted to answer the question, "Are titles that 
are used little or much in one institution used little or 
much in the other?" They examined each use-groun of titles 
(identified by use in 1949-1953) in each comparison-sample 
to see how the groups behaved in 1954-1958 in the two 
institutions in each pair. For example, the data presented 
for the Chieago-Yale comparison for Philosophy are shown 
in Table 8,1„ Of the titles that had no use at Yale in 
1949-1953, 58 were not used at Chicago in 1954-1950? 70 
were not used, at Yale during the same period. 
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TOTALS 89 85 21 12 3 4 7 150 105 



Fitting the Negative Binomial Distribution 



8,2 



Although the Fussier and Simon data are used in this 
research to perform an analysis to determine if book use 
for samples of books at other institutions is negative 
binomially distributed. Fussier and Simon collected and 
used the data for a much different purpose. For the 
research reported here, the frequency of book use during 
the period 1954—1958 in the specified library and subject 
area were extracted from the Fussier and Simon data* The 
data are presented in Table 8,2, 

The mean use for each sample was calculated by using 
the total number of uses for the books in the sample and 
the total number of books in the sample. This results in 
a different mean for each sample than that which would be 
calculated using the frequency data, since the number of 
uses of frequently used books are necessarily grouped into 
the 5+ use category. Using the mean calculated in this 
latter manner would , in general, result in an inaccurate 
fit of the negative binomial distribution to the data. 

The parameter, r, was estimated for each of the samples 
using the method of maximum likelihood. The estimated 
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liU 



Economics Chicago northwestern 55 16 7 1 0 10 B9 2.20225 

northwestern 52 20 6 2 2 4 89 .88764 

Chicago California 69 16 5 2 0 2 94 .52128 

California 57 17 8 3 3 6 94 1.03191 



value of p, was therefore determined by using the 
relationship, p =* r/ (m+r) . After estimating the parameters 
from the sample data, the negative binomial distribution 
was fit to the data using Pearson's Chi Squared Goodness 
of Fit test, A summary of the sample size (N) , sample 
mean (m) , frequency of use, values of the parameters r and 
p, calculated values of Chi Squared, degrees of freedom 
(df) and P values are displayed in Table 8,3, The P value 
is the probability of getting a value of the Chi Squared 
statistic as high or higher than the observed Chi Squared, 
The original data presented by Fussier and Simon and the 
results of the Goodness of Fit test on each sample are 
shown in Appendix J. An analysis is given in Section 8.4. 

8,3 Variance Test 



A variance test was run on each of the samples using 
the following hypotheses? 



Hypothesis : 


cr 2 


=>x_ 


Alternative ; 


cr z 


7 



For each of the samples a Chi Squared statistic was 
calculated using; 
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Where ; 



(_ = the number of uses 

= the frequency of occurence of the 
ith use 

7>i •= the mean of the sample 



A theoretical Chi Squared was calculated using the normal 



approximation for Chi Squared : 



X 



th&orei Leal 




>/2t< - 1 ' ) E 



Where : 



= value of a standard normal 

deviate at a significance level 
of 1-p 

k = degrees of freedom = N ~ 1 

The normal approximation was used since there are 
greater than 60 degrees of freedom. For all samples the 
value of Chi Squared was calculated at a significance level 
of 5%, A summary of the results of the test is displayed 
in Table 8,4, it should be pointed out that the variance 
test tables are shown in Appendix K, The Chi Squared 
observed will be understated in all samples because Fussier 
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llote: DBA means Do Hot Accept 
DNR means Do Hot Reject 



and Simon grouped frequencies with greater than 5 uses 
into their 5+ use category, 

8 , 4 Analysis 



Although not specifically for their comparisons of 
book'-'use in several instituions , but elsewhere in their 
study. Pussier and Simon make the point that the 
distribution of books by the frequency of their use is not 
Poisson distributed. Fussier and Simon devoted their 
Appendix H to this premise and state: 



"The assumption of a stochastic model does 
not suggest that the Poisson distribution 
will approximate the distribution of books 
within a library (or within a given subject 
area) by the frequency of their use during 
some period of time. If a library 
contained 10 books, each with an expectation 
of 1,000 uses per year and 100 books with 
an expectation of .01 uses per year, the 
expected distribution of a sample drawn 
from such a library, instead of being 
Poisson will show a scatter of books 
around 1,000 uses, and a scatter of books 
at: zero use and slightly above, with 
practically nothing between the scatters. 

We make this point because the fast- 
falling, convex- to- the-origin curve 
stemming from a binomial process may 
immediately suggest the Poisson to many 
readers. 
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And, in fact, the observed distributions 
do not resemble the Poisson closely, 
having a much higher variance. There are 
too many observations at "zero" and at 
multiple use points. .. . » " 

Fussier and Simon then demonstrate this point by 
showing the distribution of the number of uses for 
monographs in economics and Teutonic languages and 
literature samples with the points of a Poisson distribution 
superimposed on the actual distribution. 

Indeed Fussier and Simon are correct in their 
conclusions that the frequencies of number of uses of books 
in collections are not Poisson distributed. Their example 
using the 10 high use books and 100 low use books is, 
however, somewhat ambiguous. Their point is valid. Such 
use would not lead to a Poisson distribution just as 
normally distributed frequency of use or some other 
arbitrarily distributed frequency of use would probably 
not fit. It is the intention here to sho\tf that the Fussier 
and Simon data do fit the negative binomial distribution 
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reasonably well, 
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In all cases, the hypothesis that the variance 
is equal to the mean was not accepted at the 5 % level of 
significance. This is an important point since negative 
binomially distributed random variables necessarily have 
a variance higher than the mean, whereas the variance 
would be equal to the mean for the Poisson distribution 
and less than the mean for the binomial distribution. The 
variance test is frequently used when there are too few 
values to use a Chi Squared Goodness of Fit test. Cochran 
(1954) makes the point that he has frequently found the 
variance test to be significant when the Goodness of Fit 
test was not. He indicates that there is a striking 
difference in the power of the variance test over the 
Goodness of Fit test,. Concluding that the variance is not 
equal to the mean here does not in itself lead one to the 
conclusion that the data are negative binomially distributed. 
It does lead one to conclude that the data are not Poisson 
distributed and provides evidence that the sample data are 
negative binomially distributed since the alternative 
hypothesis is that the variance is greater than the mean. 
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The results of the eighteen goodness of fit tests 
indicate that the sample data fits the negative binomial 
distribution in 13 of the 18 samples. The samples which 
do not fit well are the four samples in the economics 
subject area at the Chicago, Nortwestern and University 
of California at Berkeley libraries, and the Teutonic 
language sample collected at the University of Chicago 
library in order to compare use at that library with the 
use at the University of California at Berkeley library. 

One could probably conclude by looking at the results of 
these Goodness of Fit tests that perhaps there is something 
peculiar about the usage of economics books which causes 
the poor fit of economics book usage data with a negative 
binomial distribution and thus would lead one to conclude 
that number of uses of individual economics books in 
libraries are not Poisson distributed. 

A closer look at the economics samples reveals that 
the ratio of the number of books with zero use to the 
number of books with greater than five uses is quite high. 

As a matter of fact, the ratios for the economies samples 
at Chicago when compared with California, and the Northwestern 
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samples have the highest ratios of all the samples. The 
economics sample at Chicago (when compared with Northwestern) 
does not. However# of the 89 books in the sample, 71 were 
not used or used once# eight books had 17 uses and 10 books 
had 173 uses. This is indeed a most peculiar distribution. 

The fit of all the distributions would have been more accurate 
if it were possible to analyze the data in the tail of the 
distribution. One could only conclude that there is something 
which causes the uses of these samples of economics books to 
be not negative binomially distributed. The other samples 
do, however, support the hypothesis of negative binomially 
distributed uses of samples of books both with the variance 
test and the Chi Squared Goodness of Pit test. Thus# the 
hypothesis of Poisson distributed uses of individual books 
tends to be supported by the Fussier and Himon data. 
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CHAPTER 9 

THE PREDICTION OF DEMAND FOR BOOKS 
The models developed in this chapter with the 
exception of that in Section 9,7 , are models for a book 
which has only one copy available. Models for multiple 
copy books have not been considered in this investigation, 

9,1 A Two Stage Book Use l lode 1 



Consider a book which has a demand rate equal to A 
per year and which is permitted to be charged out of the 
library for a period of time 1/ja, where jx is the permissible 
loan rate per year for the book, l/>u is called the loan 
interval. Suppose that we say that a book is in state 0 
when it is on the shelf and available for use, and is in 
state 1 when the book is charged out. Assume that only 
the two states r.re permissible , specifically, the book is 
in state 0 if it is not charged out and state 1 if it is 
charged out. Given that the book is in state 0 at time 
t there is a probability A 4 t +0 (At) of a transition to 
state 1 during the interval (t,t+ At) , independently of 
all occurrences before t. If the book is in state 1 at 
time tf there is a probability /la t+0(At) that there will 



be a transition to state 0 during the interval (t, t + .at) , 
independently of all occurrences before t. We assume that 
the probability of more than one transition during the 
interval (t, t + £t) is negligible and designate these 
probabilities above as OAt, An alternate specification 
of this process is in terms of two sequences of mutually 
independent random variables (T } and (T t , ,....) 

exponentially distributed with parameters equal respectively 
to X and/ . If the process starts in state 0 there is a 
transition to state 1 at time Tj * a transition to state 
0 after a further time* Tj . This two stage book use process 
is shown in Figure 9.1. Let p t (t) and p e (t) be the 
probability distributions at time t and let Pj (0) and 
p 6 (0) » 1 - p (0) be specified. 

Using the above probabilities- it follows thati 
j 0 (t vAt) = ” (XA-fc* ©(**.30 

Then letting 0 (At) — >0 
p e (t-»60^ + feCt) - 
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FIGURE 9 , 1 



A TWO STAGE BOOK USE PROCESS 



State 1 

(Book Unavailable) 



State 0 

(Book Available) 
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And 



p o dt^o- 7 ^o ^ - ?0 Ct)XAt t 

p, d t + A.t'i - p, CO ^ p e (t)X4t - p ( CO^.At 



Dividing by At 
p e Ct=*AO -p e CO 

a>"t 



6fc) + _^-p t £0 



fl g t Ati-p,fo x to -^j W 

r ° 



Letting At — ► 0 







CO 



= Xp 0 CO - >^?1 



co 



oCt 



We know 



?i 



CO = 1 - Fc/° 



Therefore for [9,1] 

- - 7lT e C0 4^[l'P^ t ^] 

d± 

T ‘<tV= - a/^O+A’/Ap/ 0 





[9.1] 



[9,2] 
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Rearranging 

CO - - < 



Taking La Place transforms 

!{?/«} - ul - s ^ 

”(, {?»' a:i } = SuCsi--p.(O) 

<£ { > } " s’ 

Then 

sues) - p 6 CO) + ( *+AOuCs> - = 0 

3 

ucso [s + + 



u Ct>) 



M. yjo) 

+ - .. , 

s [ s +&♦/*)] s + O.+ju.) 



[9.3] 



By partial fractions 



s[ S + 



Ai 

s 



A 



2, 



S+ 




jX_ - A x Cste?.t>u)j4 A 2 S 
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x. 



When s = 0 



JX 



^ A 1 C X + 



A 



JX 



X+JX 



When s « - {X+jX) 



jx * - C’X 4r Jx')X\ 



r 

K 



A. 



jx 



A •M' 1 - 



Then substituting into [9.3] 



u(s) s 









/o) 



c Xt>o«s (a+^a) C5 + CX-t^)3 Ls + C^tM^ 



Taking Inverse La Place transforms 



CO 



jx / \ jx -C^jxH 

+ - Ju 



X+jx 



\+JX 



[9.4] 



( ) 



We know p Q (t) = 1 - p A (t) 
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Therefore for [9.2] 



c It 

?1 ' (t) = A - A pi CO 
p ^ 1 CO - A. ~ C a + /Op J CO 



Rearranging 

p t ' Ct) + ( x + 

As before 



Pi 



(O 



2 . +_M> 



U) - * 









[9,5] 



For p 0 (t) and Pi(t) when t-*°° we have the equilibrium 
distribution : 



JX 

XTjX 



[9,6] 



Pi 



X + 



[9,7] 



From the above development# we could determine the 
probability of the availability of the book, p* , and the 
probability that the book is unavailable, p^ (the probability 
that the book is charged out) . 



9.2 



Comments Concerning the Two Stage Book Use Uodel 

The model for the availability and unavailability 
of a book was developed on the assumption that the number 
of demands are Poisson distributed with mean \ and that 
the loan intervals are exponentially distributed with mean 
. The model further explicitly considered only known 
demands (book charges) as permissible uses and a book which 
was charged out was in state 1 apd unavailable whereas a 
book which was not charged out w as available. This latter 
point, however, does not consider unknown demands which 
are satisfied but for which the book is used in the 
library. Considered in this way, the p^ and p^ developed 
in Section 9.1 could be considered ass 

p Q — the probability that the book is 
demanded and not available or is 
demanded, is available and used in 
the library (i , e. , the probability 
the book is not charged out) . 

the probability that the book is 
charged out. 

Thus, we are sure of the probability that the book is 
charged out, p^, and we are sure of the probability that 
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the book is not charged out, p e . We are unsure, however, 
of the components of p e , namely, the probability that the 
book is used in the library or is unavailable. The latter 
probability, the probability of a book being used in the 
library is, however, very small when one considers the 
demand rates for books in libraries (such as they are) , 
since books used within libraries are generally off the 
shelf for at most a day whereas books charged out of the 
library are off the shelf for 3 weeks, a month, a semester 
or for a longer period of time. The P class demand data 
(Section 5,6) showed no day in which there were two demands 
for the same book. Thus, unavailability of books in a 
library is not caused by unknown satisfied demands but by 
known satisfied demands and the p 0 could be considered as 
described with little error. 



9.3 Use of the Two Stage Book Use Model for L oan Jnteival 
Establishment for a Book with a Known Demand Rate 

It is assumed that the demand rate for a book, 

Af is known and that the mean loan interval is the 
loan interval, 1/^, established by the library. 

The number of demands for the book is Poisson 



distributed with mean. A, above. The particular library 
can establish a goal as to the percent of the time that 



the library wishes to have the book available for a patron. 
Using [9.fi] and [P.7], it is possible to establish a loan 
interval for the book. Thus, if the probability of 



availability is: 



Then 



Since p A * 1 - p e 



^ _ JUL 

Pe “ X TIL 



UL 



, TV*- 

~t 



For example, if the library had as its goal an availability 



of 75?: and the book had a demand rate of 3 demands per 



year : 




JUL * S 



Thus, the loan interval for the book should be 1/9 year 
or approximately six weeks. This model assumes the A is 



known. 
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Use of the Two stage Book Use Model for Loan Interval 
Establishment for a Book with a Demand Rate Which is 
Unknown 

Using equations [9.6] and [9.7], it is possible to 
determine the mean rate of book charges and the mean rate 
of unsatisfied and unknown demands. Since a patron could 
charge out a book only if it is available, p e , and the 
demand rate is A , the mean number of persons charging out 
a book is p 0 A or 



If the 'number of charges, N, for a period of time and m. for 
the period of time were known, it is possible to estimate A 
by 




[9.0] 



M 



A 






X - 



X-rV 



[9.9] 



160 



For example, if a library had a loan interval, l^u, equal 
to 1 month (/i would be 12) which was charged out five times 



during the past year. 



X = 



we could estimate A as 

12.-5 



a - a.sq 



the availability of the book could be determined by substi- 
tuting [9.9] in [9,6] I 



To 






, 58 £ 



In this way, the book is estimated to be unavailable 
slightly more than 5S£ of the time. 



The problem in estimating the demand rate and 
unavailability from this approach lies in the fact that 
the charge rate must be estimated from the number of charges 
of the book for a period of time. Two problems exist with 
this approach: (1) as Jain (1967), Morse (1965) , and others 

have shown that book use diminishes as time increases. 



Since this is so, it is necessary to select a reasonably 
short period of recent book use history for determining 
N, As a result, (2) because of the small sample size, 
the confidence interval estimate for the mean is rather 
wide, Brownlee (1960 p« 140) shows that it is possible 
to establish confidence intervals for the population 
parameter for a Poisson distribution from an observation 
as follows? 



Where >L is the upper confidence interval for the population 
parameter , jj. l is the lower confidence interval for the 
population parameter, 1 2 is the cumulative probability 

at the upper limit, **/2 is the cumulative probability at 
the lower limit, x e is the observation and the term in 
brackets is the degrees of freedom. is the value of 

Chi Squared fro^i a Chi Squared table at the specified level 
of confidence and degrees of freedom. For example, in the 
problem given above, a library observing five charges in 




[9.10] 




[9,11] 



the past year would be PSP confident that the mean number 
of charges for the book is between 1.62 and 11.66 as 
calculated by; 



| [ 2 ( 5 * 1 )] 

-- | ( 23 . 3 ) 

= II. US 

n l -- i x*m*L«xs)1 
= A (3.25) 

- i.UZ 



It is thus possible from a single observation of the 
two stage book use process to establish confidence intervals 
for the mean number of uses of the book. A library could, 
if it wished, determine the percent unavailability of a 
book based on the methods developed in this section. 
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The Concept of Critical A 



There is a certain demand rate at which it will be 
impossible for the library to meet its availability goal. 
This investigator calls this demand rate the "critical A". 
Using equations [9.6] and [9.7], it is possible to determine 
the critical A. for a book based on the library's 
availability goal. Thus: 



M. 



A 

A 



At-Pi 



Since Pj - 1 * p e 



Thus, a 
a jx. of 12 per 



library with an availability goal of 15% and 
year would have a critical demand rate, A c ,of 



x = CUX.2&) 

-C — 

(. 76 s ; 






.76 
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Any book in the collection which would, have a demand rate 
higher than 4 would be unavailable more than 25£ of the 
time. Further, this book would have a critical charge 
rate N c (known demand rate) of p 6 ^- c an< ^ using [5.8] s 




C4)Cm 
iz t H 



5 



If a book in this library would have a charge rate higher 
than 3 per year, the library would not meet its 
availability goal. 



9.6 Using the Critical A Concept 

Based on the critical demand rate, , and the 
critical charge rate, U c , the library could annually 
statistically determine which books are not available more 
than lOOps* of the time in the following manner. The 
library wants to test a null hypothesis t'J = N c and in so 
doing is testing the hypothesis that A — . They do this 

by observing the number of charges , x e , which the book had 
in the past year and use the following test statistic i 

. l-P{rt2C V l)J<2N c } 



[9.12] 



2 

Where x a is the observed number of charges, X is the value 



of Chi Squared from a Chi Squared table, the term in 
brackets is the number of degrees of freedom, and N is the 
critical demand rate. To illustrate the use of the test, 
suppose a library had a jll - 12, p© - .75, = 4, N c = 3 

and observed x & = 7. We. calculate the probability of 
getting x - 7 by using [9.12] and thus: 



Thus the probability of observing 7 charges or less if 
the book had a charge rate of 3 is ,986. 

The library could further determine a critical number 
of charges. That is, that number of charges at which they 
would conclude that their availability goal is not being 
met. Suppose the library wants to determine the number 
of observed charges X c at which they are willing to conclude 
that the charge rate is equal to the critical charge rate. 
Using [9,12] , 



P{x£7| N = 3} = 1 

P{ % <7 | N -3} - 1 " - 01l + 
P{ 7 | N s 3} - .^86 




[9.13] 
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until the 



We calculate [9.13] # for a % c off 0, 1 , 2, ..... 
probability is at a level at which the library is willing 
to risk rejecting the hypothesis that N ~ II q when it is 
true. To illustrate the use of [9.13], assume that a 
library has a \ m 4, p e = .75, N c « 3 and is willing to 
take a 5H risk of rejecting the hypothesis that N = 3 
when it is true. We calculate [9.13] for x 0 ** 0, 1, 2, 

3, 4, and f ind : 



p 


{x< 0} 


=r 


,05 


p 


{x< 1} 


— 


.20 


p 


{x£2} 


- 


.43 


p 


{x<3} 


= 


,55 


p 


{x<4} 


= 


.92 


p 


{x< 5} 




.92 


p 


{x< 6} 


— 


.97 


p 




=: 


.99 



A library then which is trying to maintain the goals stated 
in this illustration and would not want to reject their 
hypothesis of an N c = 3 when it is true of *05 ( 1 — .95) 
would have to establish a critical number off charges, x £ , 
of 5 . A library using this critical number of charges 
concept could, upon observing 7 charges of a book, conclude 
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that the demand rate for the book is at least at the 
critical level 'k c and could adjust their charge rate. 

They could, for example, cut their loan interval in half. 

If jx were thus doubled in this manner, the new critical 
demand rate would be (using Section 9.5); 

a - ( ) (.23) 

■ c — — - — - 

(. 75 ) 

= 8 

The charge rates for the specific books could be adjusted 
on an annual (or periodic) basis in this manner. 

9.7 Establishing Confidence Intervals for Demand Rates 
for Books in a Collection 

One of the questions which could be answered from 
this investigation is, "what is the probability of a specific 
book in a collection having a demand rate exceeding some 
specific value?" Concluding, as has been done in Chapters 
6, 7 and 8, that the number of demands for a group of books 
is negative binomially distributed permits the conclusion 
that not only are the number of demands for specific books 



Poisson distributed but that the demand rates for these 



books vary according to a Gamma distribution. Using the 
Gamma distribution and collected known demand data (such 
as that of Section 6,1) or of total demand data (such as 
that of Section 6.2) it is possible to predict the 
probability of getting demand rates or charge rates of 
specified values. Prior to doing this, however, it will 
be worthwhile to look at the Gamma distribution and its 



peculiarities. From [3. 5], ,2. is distributed as 






Cf 



~c% 

JL 



r - A 



where the parameters are C and r. r is the shape parameter 
and G is the scale parameter. Changing C changes the scale of 
the two axes, When r = 1 the distribution becomes the 
exponential distribution. The cumulative distribution is 



Fa) = / 



rco 



J^ C *A r ’ l dL\ 



[9,14] 
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when A>G and is 0 when A4 O • The integral must be evaluated 
by numerical methods unless r is integer in which case 
F(^>) could be found by: 



Fa-) - i y 



Cca') 



L ! 



The r’s for the collected demand data are not integer. 

In order to calculate probabilities from this function, 
it is necessary to numerically calculate probabilities 
from [9 . 14] or use the tables of the Incomplete Gamma 
Function which were developed by Pearson (1965). The 
tables present tabular values of an I(v,p) function where: 

r v - v P . 

I JU v dL\t 

I<v,p> = — [9.15] 

J JL,'* v T d- ^ 



where ’the I(v,p) function always falls between 0 and 1. 
Pearson makes a further transformation because of the 
wide range of the value of the argument v. He uses 



V 

u = 



Vp + i 
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Pearson's tables 

I C s 



thus 




O 



really 




JU 



present 
P , 

V ttl/ 



JL 



-v 



v y dt.v' 



X(u,p) provides the probability of a deviation exceeding 
a certain size. 

In order to use Pearson's tables, it is necessary 
to transform [9.14] in the following way. Vie substitute 



P + 



for r and get s 



F C 



r c^_ 

c r ('prl) 





cL X 



[9.16] 



This transformation prevents a negative factorial in the 
gamma function. Vie then let u — a/ */ p+1 . We then gets 



1 (*,?') 



r 

Jo 



UL 






p+v ~CX T 
C JL* 1 



cU 



rcp+n 



j 

o 



90 



C^ 1 j L - C ^? T 



P(p-o 



oU 



o 
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which upon cancelling common terms becomes 

, ULl/p+i 



1 ( u.jO 



1 
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which is identical to [9.15] and we are prepared to use 
Pearson's tables. We illustrate their use for establishing 
a confidence interval for the P class demand data (Section 
6,2). We concluded that the number of demands for the 
sample of books in that class was negative binomially 
distributed with mean = .143*1*1 and variance » .20785. The 
parameters r and p were calculated to be .41276 and ,69011. 

We want to determine a one sided 95$ confidence interval 
for the variable X knowing that r = .4127*5. We consult 
Pearson's tables to find the value of u for which p * 1.41276 
and I(u,p) = ,05. We find that: 

I{.3,1.4) = .0386136 
£(.4,1.4) ^ ,0693255 
£(•3,1.5) = ,0334023 
£(.4,1.5) -■ ,0614995 

Using 2 way interpolation, we calculate u, for ,05 to be 
,33967. W 7 e transform u to X using the transformation 
X ss u s/p+1 and 1 is .5278. Since the A is the demand rate for 
only a 51 day period, we extend the rate, to a yearly basis, 
and say that we are 95$ confident that books in the P class 
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have derand rates, which do not exceed 3.7H per year . The 
methods of the previous sections could then be used to 
determine the consequences of such a demand rate, 

9, . 0 Comments and Constraints Concerning Use of the Demand 

Models 

The models and use of the models in this Chapter 
assume that the number of demands for a hook are Poisson 
distributed , the number of demands for a group of books 
is negative binonially distributed and that the demand 
rates vary from book to book according to a Gamma 
distribution. Pections 9,1 through 9,6 deal with a book 
in which it is assumed that a single copy of the book is 
in the collection. The models and their use do not apply 
to multiple copy books. The establishment of a confidence 
interval for the demand rate in Pection 9,7 does not assume 
that there is only one copy of the book available. Total 
demand is independent of the number of copies of the book 
which are available. Known demand (home use) is not. The 
models developed in 9.1 through 9.6 further assume that 
loan intervals are exponentially distributed. This 
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investigator was unable to verify that the loan intervals 
were exponentially distributed using 5UNY/Buffalo data 
because of the peculiar nature of the loan policy at that 
institution. The policy is that any book may be charged 
out for a semester loan subject to recall if it is 
requested. The minimum loan interval is three weeks. 

This investigator was unable to fit a known distribution 
to the CUNY/Buffalo loan interval data although, if 
anything, the data was possibly Gamma distributed although 
there was insufficient evidence to reach this conclusion. 
The SUNY/Buf f alo system assumes that a person needing a 
book will request it if it is unavailable. This is simply 
not the case. Virtually all of the respondents in the P 
class demand study did not ask that the book be saved or 
found when they did not find the book. The SUNY/Buf f alo 
loan interval data showed a mean of about one half of a 
semester* This would be expected in an institution which 
has a semester loan policy since we would expect half of 
the book charges to occur before and half after the 
midsemester point. Nevertheless, the developed models 
would be applicable to a library which had a fixed 
loan interval (? weeks, 5 weeks, a month, etc,). 
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A study 



performed at Purdue University* showed that the loan 
intervals for books at that library were, in fact, 
exponentially distributed with a mean equal to about the 
specified loan interval. We would conclude then that the 
models would be applicable in a typical library environment 
if the number of demands for a bock are Poisson distributed 
and the loan intervals are exponentially distributed. 



♦Informal communication from Edward O'Neill presently at 
the State University of New York at Buffalo, 
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CHAPTER 10 



SUMMARY OF RESULTS, OBSERVATIONS 
AND RE COMMENDATIONS FOR 
FURTHER WORK 



10,1 



Summary of Results and Observation 



In proposing this research, this investigator had 
two objectives* (1) to provide evidence that one cannot 
reject the hpothesis that the number of demands for books 
is Poisson distributed and (2) to develop a practical model 
with which it would be possible to predict demand based on 
known demand data generated in library circulation systems. 

An attempt was made to fulfill objective 1 by 
collecting data about the number of demands for books and 
determining how the demands for books were distributed. 

Since there is so little data available about the demand 
for any one book to statistically conclue the type of 
distribution, demands for groups of books were used. The 
investigation disclosed that the number of demands for groups 
of books appears to be negative binomially distributed. 
Reaching this conclusion establishes an experimental basis 
for concluding that the number of demands for individual 
books is Poisson distributed and that the demand rate varies 
from book to book according to a Gamma distribution. 
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Evidence of this was provided by using the number 
of demands for a large sample of P class books in the 
collection of the SUNY-Euf falo libraries and by using a 
large random sample of books from the entire StJNY— Buffalo 
library collection* Both these samples appeared to have 
Negative Binomially distributed demands, although the P class 
demand sample provided weaker evidence with the Chi Squared 
goodness of fit test. A further analysis of the number of 
demands per day for samples of P class demanded books showed 
that the demand per day for these samples was distributed 
according to Poisson distributions. As has been shown, the 
results of the analyses on all three sets of data permits the 
inference that the number of demands for individual books is, 
indeed, Poisson distributed. Using the sample from the entire 
collection permits the conclusion that not only do P class 
books exhibit Poisson distributed demand properties but that 
there is evidence that the demand for all books is Poisson 
distributed. To permit generalization of these results to 
other libraries, 18 samples of book use data in several 
subject areas at four other libraries showed that, for the 
most part, with the exception of economics books, the number 
of uses for groups of books at other libraries was Negative 
Binomially distributed permitting some generalization of the 
results to other subject areas and libraries. 
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Although this investigator wanted to develop a 
practical model for predicting demand for individual books 
(Objective 2) , the results here provide a good mechanism 
for predicting demand for collections of books. By accepting 
the Negative Binomial distribution as an adequate represen- 
tation of the distribution of demands for a collection of 
books, any library can, with the collection of demand 
information about a small sample of books, draw inferences 
about the demand for the entire collection. This was 
demonstrated in Chapter 9 using the Gamma Distribution. 

The investigation also showed that it is possible to predict 
the demand rate for a book by using an observation of the 
number of known demands for a period of time although the 
confidence interval estimate for the demand rates are somewhat 
broad, A more practical approach may depend on a library 
availability goal and the predetermination of a critical 
demand rate. Using the critical demand rate permits testing 
a hypothesis that the demand rate is at a critical level 
and provides an action point for* permitting library managers 
to alter their loan interval or add a copy of a book. The 
prediction of a demand rate as developed in this research is 
based on the assumption that only one copy of a book is 



available 



10.2 Recommendations for Further Work 

This research has merely presented a beginning for 
demand model development. The results presented here are 
based on Poisson distributed demand and exponentially dis- 
tributed loan intervals. Further work should be performed 
in order to determine the nature of the distribution of 
loan intervals so that models could be developed for 
predicting demand for other distributions of loan intervals. 
Investigations should also be performed for predicting demand 
based on the availability of multiple copies of a particular 
book. The effects of deviations, in practice, from Poisson 
distributed demand and exponentially distributed service 
times should also be examined, Such an investigation may 
show that it is not necessary to collect extensive loan 
interval or demand data because, perhaps, an assumption of 
a Poisson distribution for demand and an exponential 
distribution for loan intervals may be more than adequate for 
a library* s purpose in attempting to determine book availa- 
bility or unavailability. Further impetus is given to this 
recommendation by the fact that collecting data concerning 
demand for books in libraries is tedious, time-consuming 
and expensive. 
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APPENDIX A 



QUESTIONNAIRE FOR P CLASS BOOK DEMANDS 



Dot e Time 

THE RESULTS OF THIS QUESTIONNAIRE WILL HELP THE LIBRARY IMPROVE ITS SERVICE, 
YOUR COOPERATION WILL BE APPRECIATED. 

For ec>cb book which you looked Tor, found or used, lift the complete cell number on th® left 
and circle your answer to each qf the three questions on the right# Reese circle the a rawer 
to the question even ff you did not find the book you wanfod. 



Call Hunger 



Question 

1) Did you have the call number when 
you came into the stacks? 

2) If you had the call number, did you 
find the book you looked for? 



3) If you found (or would have found) the U»® R In the Library 

book will you (would you)? charge ft Out 

Not sure if I wHI(would) 
charge if out or not 



Answer 

YES 

NO 



Call Number Question Answer 

— — — — 1 ) Did you Have the call number when YES 

you came Into the stacks? 



2) If you had the call number, did you YES 

find the book you looked for. *iq 





3) If you found (or would liave found) the 


Use It In the Library 




book, will you (would y®i)? 


Charge It Out 






Not sure if 1 will(would) 
charge it out or not 


Call Number 


Question 

1) Did you hav j the calf number when 
you came into the stacks? 


Answer 

YES 

NO 










2) If you »;?d the call number, did you 
Rnd the book you looked for? 


YES 




NO 




3) If you found (or would have? found) the 


Use It In the Library 




book, will you (would you)? 


Charge it Out 



Not sure if I wil((weuid) 
charge it out or not 




Call Nu nfeer 



Question 



Answer 



I) Dfd you have the call number, when 
you cam* into the stacks? 



YES 



NO 



2) If you had the call number, did your 
find the book you talced for? 



YES 

NO 



3) If you found (or would have found) the 
book, will you (would you)? 



Use it In the Library 
Charge* It Out 

Not sure if I wltl(would) 
charge it out or not 
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appendix b 






INSTRUCTIONS TO DATA COLLECTORS 



1) Assign a patron number to a sheet, record the date and time and hand the sheet 
to the patron. Tel! him that If he needs additional sheets tha: they are available. 

Pdtrons are to be assigned numbers sequentially beginning In the morning of each 
day with "001 , 11 

NOTE: A patron Is to be assigned a number even if he refuses a sheet. The word 
"Refused" is to be written in the top right hand corner of the sheet of a patron 
who refuses to take one or who turns it in uncompleted. 

2) When the sheet Is turned In, the data collector Is to scan the sheet and 

a) If the sheet contains no call numbers beginning with "P", he Js to write 
the phrase, "Not Applicable" In the top right hand corner of the sheet. 

b) If the sheet contains call numbers beginning with "P" but call numbers 
are not complete; the handwriting Is illegible, or It Is clear that the 
patron did not answer the questions properly, the phrase "Can't Use" 

Is to be written on the sheet. 

3) After the patron has filled out and turned In the data sheet, th© data collector 
Is to enter the appropriate data on an 80 column data sheet. The following 
codes are to be used* 



Columns 

1-10 

11-20 

21-30 

31-40 



C los^Ifl cation Letters (6*9** PB, PA, PQ, P) 

Classification (e.gi P23, PB] 5, PQ2673, PA1) 

Second Line of Call Number (e.gi 04273/ 

T86, G5j G8G83) 

Third Line of Call Number If there Is one 



41-50 

51-60 

61&62 



63,64,65,66 



Fourth Line of Call Number 

Leave Blank 

Day of the Study (e*gi 15, 19, 63) 

Note: This Is not the day of the month. The 
first day of the study was day 01 , The 
last day Is day 70. 

The time of the day on the 24 hour clock. 

(e.g; 1213, 1516, 2202) 

Note: The 24 hour clock begins at midnight, one 

minute after midnight would be 0001, 1 iOOA.M* 
would be 0100, 3:15 in the afternoon would be 
1515, 8:00 P.M. would be 1600. 

Enter a "l" If the answer to question 1 was "YES.' 1 
Enter " 2 * If It was ,! N07' 
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71 



Columns 

72 Enter !I 1 11 If the answer to question 2 was "YES, 1 * 

Enter a ,I 2 M if the answer to question 2 was "NO, " 

73 Enter M 1 11 if the answer to question 3 was “Use It 

In the Library. " Enter a "2" if the answer was 
"Charge it Out.” Enter "3" if the answer was 
"Not sure If I wlll(would) charge ft out or not." 



74 Leave Blank 

73 Enter "1 11 if the patron's data sheet was useable. 

Enter "2" if the patron's data sheet contained no 
"P" classifications. Enter "3" If the data sheet 
was unuseable but contained " P" classifications* 
Enter "4" If the person refused to cooperate. Enter 
"5 " If the person browsed but used no books. 



78^9,80 The patron's number, 

4) The data collector Is to Initial each data sheet after he has transferred the information 
to the 80 column summary sheet. 

5) It Is Important to Print Clearly on the data sheet and to distinguish between the letter 
2 and the numeral 2, 2’s written as look like Z's. Therefore, always cross 
hatch the Z e.g. . Also the letter I sometimes looks like a 1 . Therefore, always 
write the I as X* ■ * et f® r O I* written as . 

Examples 

Attached Is a Xerox Copy of some entries on a sample 80 column summary sheet. 



Examples; 

1st Line 

Patron 001 listed the book 



PR 3142 
G 6 



Columns 

61-62 It was the 15th day of the study 
63-66 It was 7:20 A. M. 

71-73 His answers to the three questions were: ,, YES ll / "YES", and "Charge It Out 
75 His data sheet was useable 

2nd - 7th Li ne 

Patron 002 listed six books with the call numbers shown. 

Columns 

61-62 It was the 15th day of the study 
63-66 It was 8:22 A. M. 

71-73 His answers were- "YES", "NO", "Use In Library" 

75 Hfs data sheets were useable. 



8th Line 



Patron 005 refused to cooperate 
Column 

75 Coding "4 11 Indicates he refused to cooperate 
9th Line 

' Patron 003 turned In a data sheet which contained no “P" classification* He 
arrived on the 15th day at 1 1x20 A.M* 

10th Line 

fatron 004 turned In a useable data sheet with one "P" Item on ft and one "J" Item 
on It* No entry Is made for the "J" Item. The call number of the "P" Item wasi 

PQ56 
A123Z56 
Vol 2 
1965 

He arrived on the 15th day at 1x20 P.M. He dldn u answer the 1st and 2nd questions,, 
but answered the 3rd question, "Not Sure If I wll l(would) charge It out or not*” 

His data sheet was useable. 

11th Line 

Patron 006 turned In a data sheet which contained the following call number* 

PE42 

This Is not a complete call number and Is unuseable. He arrived at 3f 1 0 P*M* 
on the 1 5th day- 

6) The last data collector working each day will take the 80 column data sheets and the 
questionnaires for that day and put them all In a brown envelope. He will mark the 
number of the day on the envelope. The circulation data collector will pick this up 
the following morning. 

7) If In doubt ah^ut the correct day number 



Monday 


March 


4 — day 2 


ii 


ii 


11 * day 9 


if 


ii 


1 8 - day 16 


ll 


ll 


S ~ day 23 


II 


April 


1 = day 30 


II 


ii 


8 — day 37 


ft 


it 


15 s day 44 


II 


Ii 


22 - day 51 


ll 


ll 


29 ^ dev 58 


II 


May 


6 — day 65 


Saturday 


May 


1 1 day 70 (last day) 
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Circulation Data Collector 



1) pick up the 80 column summary sheet downstairs. 

2 ) From the circulation desk you will receive: 

a) A stack of "P" classification charge cards 

b) A list of "P" classification "saves" and "traces" 

e) A list of "P" classification saved books which were charged out. 

3) You are to compare the charge cards with the 80 column list. If the Item was 
charged out, you are to enter a "1 " In column 74. If It was not charged out 
you are to ©(nter a ,, 2 11 In column 74 * 

4 ) You will probably have some cards for books which were not I I^d on the 80 c ° lu J n 
summary sheet. If you do, fill out a small data sheet far each book. Do not enter 
any Information on the 80 column summary sheet. 

5) You should then have: 

a) 80 column summary sheets 

b) A list of "P" classification saves and traces 

e) A list of 11 P H classification saved books which were charged out 

i) 

6) Put all the Information In a brown envelope and deliver to Ralph Hall . 
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SAMPLE 






^ ' 

\ 




l 

i 
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APPENDIX C 



PUNCHED CARD FORMAT 



Columns 



1 - 


4 


Classification letter 


11 - 


20 


Classification 


21 - 


30 


2nd line of call number 


31 - 


40 


3rd line of call number 


41 - 


50 


4th linf of call number 


51 - 


60 


Last name or student number of patron 


61 - 


62 


Day of the study 


63 - 


66 


Time of the day 


71 




1 if patron had call number 

2 if patron did not have call number 


72 




1 if desired book was found 

2 if desired book was not found 


73 




1 if patron said he would use it in 



the library 

2 if patron said they would charge 
it out 



74 

75 



3 if patron did not know 

1 if book was charged out 

2 if book was not charged out 

0 if book was charged out and did not 
appear on the questionnaire 

1 if the questionnaire was useable 

2 if no P ' s on questionnaire 

3 if the questionnaire was not useable 

4 if questionnaire was refused 

5 if patron was a browser and found 
nothing 

6 if request that a book be saved 

7 if request that a book be traced 

8 if a book from the save shelf was 
charged out 

9 if a book v/as charged out but not 
recorded on the questionnaire 



O 

ERIC 



174 



APPENDIX D 



TABLE USED TO CALCULATE CH^ SQUARED 
FOR FITTING THE NEGATIVE BINOMIAL 
DISTRIBUTION TO THE SHELF LIST SAMPLE 
FROM THE SUNY-BUFFALO COLLECTION 



The 'table in this appendix shows the cell value 
cell frequency, theoretical probability, theoretical 
cell frequency, difference between the observed cell 
frequency and theoretical frequency, the square of the 
difference, and the component of Chi Squared for each 
cell for the shelf list sample data in Chapter 6. 
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r- c\j n rr in kd 



TABLE D . 1 



CHI SQUARED TEST TABLE FOR KNOWN DEMAND SAMPLE 
COLLECTED FOR THE SUNY - BUFFALO COLLECTION 
DURING THE PERIOD SEPTEMBER 1, 1966 - AUGUST 31, 1967 

OBSERVED THEORETICAL CHI 



VALUE FREQUENCY PROBABILITY 



530 


,74370 


112 


.16345 


42 


,05612 


9 


.02158 


4 


.00874 


1 


.00365 


4 


,00276 


702 


1 . 00000 



FREQUENCY 


DIFFERENCE 


SQUARED 


522.077 


7.9226 


.1202 


114.742 


-2.7419 


.0655 


39.396 


2.6038 


,1721 


15.149 


-6.1492 


2,4960 


6.135 


-2,1355 


.7433 


2.562 


-1 . 5623 


.9526 


1 .938 


2,0625 


2. 1955 


702.000 


,0000 


6.7452 




i ' 
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APPENDIX E 

TABLE USED TO CALCULATE CHI SQUARED 
FOR THE VARIANCE TEST ON THE SHELF LIST 
SAMPLE FROM THE SUNY-BUFFALO COLLECTION 



The table in this appendix shows the cell 
value, cell frequency, the square of the difference 
of the cell value and mean, the frequency of occurrence 
of the cell value multiplied by the difference, and the 
component of Chi Squared for the cell for the shelf 
list sample in Chapter 6. 



o 
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TABLE E, 1 



VARIANCE TEST TABLE FOR KNOWN DEMAND SAMPLE 
COLLECTED FOR THE 5UNY - BUFFALO COLLECTION 
DURING THE PERIOD SEPTEMBER 1, 1966 - AUGUST 31 , 



i 


f X 


2-* 

(i - m) 


f^(i - m) 


f^(i - m) 


0 


530 


,145 


76,935 


201,930 


1 


112 


.383 


42.914 


112.635 


2 


42 


2.621 


110.089 


288,947 


3 


a 


6,859 


61,732 


162.027 


4 


4 


13,097 


52,389 


137,503 


5 


1 


21,335 


21,335 


55,998 


6 


3 


31,573 


94,719 


248.608 


7 


1 


43.811 


43,811 


114,990 



1322,638 




13& 



1967 

/m 
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APPENDIX F 



TABLE USED TO CALCULATE CHI SQUARED 
FOR FITTING THE NEGATIVE BINOMIAL 
DISTRIBUTION TO THE SAMPLE OF P CLASS BOOKS 



The following table shows the cell value , 
cell frequency, theoretical probabilities, theoretical 
cell frequencies, the difference between the observed 
and theoretical cell frequencies, and the component of 
Chi Squared for each cell for the P class demand data 
in Chapter 7, 




179 




TABLE F * 1 

CHI SQUARED TEST TABLE FOR P CLASS SAMPLE 
COLLECTED FOR THE SUNY - BUFFALO P CLASS 
COLLECTION DURING THE PERIOD 
MARCH 3, 1968 - MAY 11 r 1968 



OBSERVED THEORETICAL CHI 



VALUE 


FREQUENCY 


PROBABILITY 


FREQUENCY 


DIFFERENCE 


SQUARED 


0 


650 


,85805 


628.093 


21.9074 


.7641 


1 


65 


.10975 


80.337 


-15,3370 


2,9280 


2 


13 


,02402 


17,583 


- 4.5826 


1.1944 


3 


2 


,00599 


4.385 


-2.3847 


1,2969 


4 


2 


.00219 


1.603 


,3969 


.0983 




732 


1,00000 


732.000 


,0000 


6.2817 
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APPENDIX G 

TABLE USED TO CALCULATE CHI SQUARED 
FOR THE VARIANCE TEST ON THE 
1 RANDOM SAMPLE OF P CL^S BOOKS 



The following table shows the cell value, 
cell frequency, the square of the difference 
between the cell value and the mean, the frequency 
of the cell value multiplie 1 by the difference, 
and the component of Chi Squared for the cell for 
the data in Chapter 6 . 



TABLE G, 1 



VARIANCE TEST TABLE FOR P CLASS DEMAND SAMPLE 
COLLECTED FOR THE SUNY « BUFFALO P CLASS COLLECTION 
DURING THE PERIOD MARCH 3, 1968 - MAY 11, 1968 



f • (i - m) 



f /. (i 



m) 



**Vm 



i f ■ 

Ay 

0 650 

1 65 

2 13 

3 2 

4 2 



(i - m)^ 

,020 

,734 

3.448 

8.162 

14,876 



13.292 

47.739 

44.830 

16.325 

29,753 



92.950 

333.840 

313,495 

114.160 

208,062 



1062.508 
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APPENDIX H 



\ 



TABLES USED TO CALCULATE CHI SQUARED 
FOR FITTING THE POISSON DISTRIBUTION TO THE 
RANDOM SAMPLES OF P CLASS DEMANDED BOOKS 



The following tables show the cell value, 
cell frequency, theoretical probabilities, theoretical 
cell frequencies, the difference between the observed 
and theoretical cell frequencies, the square of the 
difference, and the component of Chi Squared for each 
cell for the data in Chapter 7* 
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TABLE H, 1 



CHI SQUARED GOODNESS OF FIT TABLE FOR FITTING 
THE POISSON DISTRIBUTION TO SAMPLE 1 
OF P CLASS DEMANDS FOR THE PERIOD 
MARCH 3 , I960 - MAY 11, 1968 





OBSERVED 




THEORETICAL 




CHI 


VALUE 


FREQUENCY 


PROBABILITY 


FREQUENCY 


DIFFERENCE 


SQUARED 


0 


2 


.07226 


3,685 


=1,6853 


,7707 


1 


9 


,18986 


9.683 


-.6829 


.0482 


2 


16 


.24943 


12.721 


3.2791 


,8452 


3 


13 


.21845 


11.141 


1.8591 


,3102 


4 


6 


.14349 


7.318 


-1.3180 


.2374 


5 


2 


.07540 


3,845 


-1.8454 


.8856 


6 


3 


.05111 


2.607 


.3934 


.0594 




51 


1.00000 


51.000 


.0000 


3,1566 



TABLE H, 2 

CHI SQUARED GOODNESS OF FIT TABLE FOR FITTING 
THE POISSON DISTRIBUTION TO SAMPLE 2 





OF 


P CLASS DEMANDS 


FOR THE 


PERIOD 








MARCH 3, 1968 - 


MAY 11 


, 1968 






OBSERVED 


THEORETICAL 




CHI 


VALUE 


FREQUENCY 


PROBABILITY FREQUENCY 


DIFFERENCE 


SQUARED 


0 


5 


.08128 


4.145 


.8547 


.1762 


1 


14 


.20400 


10.404 


3,5960 


1.2429 


2 


10 


,25600 


13.056 


-3,0560 


.7153 


3 


9 


,21417 


10,923 


-1.9227 


.3384 


4 


6 


.13438 


6.853 


-.8534 


.1063 


5 


3 


.06745 


3,440 


-.4399 


.0563 


6 


4 


.04272 


2.179 


1.8213 


1.5225 




51 


1.00000 


51,000 


.0000 


4.1579 
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TABLE H.3 



CHI SQUARED GOODNESS OF FIT TABLE FOR FITTING 
THE POISSON DISTRIBUTION TO 'SAMPLE 3 
OF P CLASS DEMANDS FOR THE PERIOD 
MARCH 3, 1968 - MAY 11, 1968 



VALUE 

0 

1 

2 

3 

4 

5 

6 
7 



OBSERVED 




THEORETICAL 




CHI 


FREQUENCY 


PROBABILITY 


FREQUENCY 


difference 


SQUARED 


4 


.05384 


2,746 


1,2542 


.5728 


10 


.15732 


8.023 


1.9767 


.4870 


13 


.22981 


11.720 


1,2797 


,1397 

* . - . 


6 


.22380 


11,414 


-5,4138 


2,5679 


7 


.16346 


8.336 


-1.3365 


.2143 


7 


.09551 


4.871 


2.1290 


.9305 


2 


,04650 


2.371 


-.3715 


,0582 


2 


.02976 


1.518 


.4822 


.1532 


51 


1.00000 


51.000 


,0000 


5.1236 



TABLE H.4 

CHI SQUARED GOODNESS OF FIT TABLE FOR FITTING 
THE POISSON DISTRIBUTION TO SAMPLE 4 
OF P CLASS DEMANDS FOR THE PERIOD 
MARCH 3, 1968 - MAY 11, 1968 



VALUE 



OBSERVED THEORETICAL 

FREQUENCY PROBABILITY FREQUENCY DIFFERENCE 



CHI 

SQUARED 



0 

1 

2 

3 

4 

5 

6 



7 
14 
10 

8 
6 
4 
2 



,10696 


5.455 


.23908 


12.193 


.26721 


13.628 


.19910 


10.154 


*11126 


5.674 


,04974 


2.537 


.02665 


1.359 



1.5450 .4376 

1.8069 .2678 

-3.6277 .9657 

-2.1541 ,4570 

.3257 .0187 

1.4633 .8440 

.6409 .3022 



51 



1.00000 



51.000 



.0000 



3.2930 
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TABLE H , 5 



CHI SQUARED GOODNESS OF FIT TABLE FOR FITTING 
THE POISSON DISTRIBUTION TO SAMPLE 5 
OF P CLASS DEMANDS FOR THE PERIOD 







MARCH 3, 


1968 - MAY 11 


, 1963 






OBSERVED 




THEORETICAL 




CHI 


VALUE 


FREQUENCY 


PROBABILITY FREQUENCY 


DIFFERENCE 


SQUARED 


0 


8 


.08791 


4.483 


3.5166 


2.7583 


1 


12 


.21375 


10.901 


1.0987 


.1107 


2 


8 


.25986 


13,253 


-5.2529 


2,0820 


3 


1 1 


.21060 


10.741 


.2594 


.0063 


4 


7 


.12801 


6.529 


.4715 


,0341 


5 


1 


.06225 


3.175 


-2.1747 


1,4897 


6 


4 


.03762 


1.919 


2.0814 


2.2579 




51 


1.00000 


51.000 


.0000 


8,7390 



TABLE H, 6 

CHI SQUARED GOODNESS OF FIT TABLE FOR FITTING 
THE POISSON DISTRIBUTION TO SAMPLE 6 
OF P CLASS DEMANDS FOR THE PERIOD 
MARCH 3, 1363 - MAY 11 # 1968 



VALUE 


OBSERVED 

FREQUENCY 


PROBABILITY 


THEORETICAL 

FREQUENCY 


DIFFERENCE 


CHI 

SQUARED 


0 


8 


.09697 


4.945 


3.0545 


1,8866 


1 


10 


,22626 


11.539 


-1.5393 


,2053 


2 


11 


.26397 


13.462 


-2.4625 


,4504 


3 


10 


.20531 


10.471 


-.4708 


,0212 


4 


5 


. 1 1976 


6.108 


-1.1078 


,2009 


5 


6 


,05589 


2,850 


3,1496 


3,4802 


6 


1 


,03184 


1,624 


-.6238 


.2397 




51 


1,00000 


51,000 


.0000 


6.4843 
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0 

1 

2 

3 

4 

5 

6 

L 

0 

1 

2 

3 

4 

5 

6 



TABLE H. 7 



CHI SQUARED GOODNESS OF PIT TABLE FOR FITTING 
THE POISSON DISTRIBUTION TO SAMPLE 7 
OF P CLASS DEMANDS FOR THE PERIOD 
MARCH 3, 1968 - MAY 11, 1968 



OBSERVED THEORETICAL CHI 

FREQUENCY PROBABILITY FREQUENCY DIFFERENCE SQUARED 



7 


.09889 


5 


10 


.22880 


1 1 


13 


.26470 


13 


11 


.20414 


10 


5 


.11808 


6 


3 


.05464 


2 


2 


.03075 


1 



043 


1.9566 


.7591 


669 


-1.6688 


.2387 


500 


-.4997 


.0185 


411 


.5889 


.0333 


022 


-1.0221 


.1735 


787 


.2134 


.0163 


568 


.4317 


.1189 



51 



1.00000 



51.000 .0000 1,3582 



TABLE H, 8 



CHI SQUARED GOODNESS OF FIT TABLE FOR FITTING 
THE POISSON DISTRIBUTION TO SAMPLE 8 
OF P CLASS DEMANDS FOR THE PERIOD 
MARCH 3, 1968 - MAY 11, 1968 

OBSERVED THEORETICAL CHI 

FREQUENCY PROBABILITY FREQUENCY DIFFERENCE SQUARED 



9 

7 

13 

11 

2 

7 

2 


.08791 

.21375 

.25986 

,21060 

,12801 

,06225 

,03762 


4.483 

10,901 

13,253 

10,741 

6.529 

3.175 

1.919 


4.5166 

-3,9013 

-.2529 

.2594 

-4.5285 

3,8253 

,0814 


4,5500 

1.3961 

.0048 

.0063 

3.1412 

4.6090 

,0035 


51 


1,00000 


51.000 


,0000 


13.7109 
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TABLE H,9 



CHI SQUARED GOODNESS OF FIT TABLE FOR FITTING 
THE POISSON DISTRIBUTION TO SAMPLE 9 
OF P CLASS DEMANDS FOR THE PERIOD 
MARCH 3 , I960 - MAY 11, 196 3 

OBSERVED THEORETICAL CHI 

VALUE FREQUENCY PROBABILITY FREQUENCY DIFFERENCE SQUARED 



0 


6 


,08965 


4.572 


1 ,4279 


.4459 


1 


11 


,21623 


1 1,028 


-.0277 


,0001 


2 


12 


.26075 


13,298 


-1.2982 


.1267 


3 


9 


,20962 


10,691 


-1.6906 


.2674 


4 


7 


,12638 


6,445 


.5546 


,0 477 


5 


3 


.06096 


3.109 


-.1090 


.0038 


6 


3 


,03641 


1.857 


1,1431 


,7037 




51 


1,00000 


51.000 


.0000 


1.5953 







TABLE H. 1 0 








CHI SQUARED GOODNESS 


OF FIT TABLE 


FOR FITTING 






THE 


POISSON DISTRIBUTION TO SAMPLE 10 






OF 


P CLASS DEMANDS FOR THE 


PERIOD 








MARCH 3, 1968 - MAY 11, 


1968 






OBSERVED 




THEORETICAL 




CHI 


VALUE 


FREQUENCY 


PROBABILITY 


FREQUENCY 


DIFFERENCE 


SQUARED 


0 


7 


.09324 


4.755 


2.2448 


1.0597 


1 


9 


,22122 


11,282 


-2,2822 


.4617 


2 


14 


.26243 


13.384 


.6161 


,0284 


3 


10 


,20754 


10.585 


-.5845 


.0323 


4 


6 


.12310 


6,278 


-.2781 


.0123 


5 


1 


,05841 


2.97S 


-1.9709 


1.3146 


6 


4 


.03406 


1.737 


2.2629 


2.9480 




51 


1.00000 


51,000 


.0000 


5.8569 
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APPENDIX I 



TABLES USED TO CALCULATE CHI SQUARED 
FOR THE VARIANCE TEST ON THE RANDOM 
SAMPLES OF P CLASS DEMANDED BOOKS 



The following tables show the cell value , 
cell frequency, the square of the difference between 
the cell value and the mean, the frequency of the 
cell value multiplied by the difference, and the 
component of Chi Squared for the cell for the data 
in Chapter 7, 
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TABLE I , 1 



i 

0 

1 



VARIANCE TEST TABLE FOR SAMPLE 1 
OF THE P CLASS DEMANDS DURING 
THE PERIOD MARCH 3, 1968 - MAY 11, 1968 





(i - m)^ 


f^(i - m) 7 ^ 


L. (i * 


2 


6.903 


13.807 


5,255 


9 


2.649 


23,837 


9,072 


16 


. 394 


6.299 


2.397 


13 


.139 


1 , 804 


.687 


6 


1 . 884 


11,303 


4.302 


2 


5.629 


1 1.258 


4,285 


2 


1 1.374 


22.748 


8.658 


1 


19.119 


19.119 


7.277 




41,933 



TABLE 1.2 



VARIANCE TEST TABLE FOR SAMPLE 2 
OF THE P CLASS DEMANDS DURING 



THE 


PERIOD MARCH 3, 


1968 - MAY 1 1 , 


1968 




(i - m) 7 "' 


f^(i - m) 3 *^ 


f x (i ■ 


5 


6.299 


31,495 


12,549 


14 


2.279 


31,913 


12,715 


10 


.260 


2,599 


1,036 


9 


.240 


2,163 


,862 


6 


2.221 


13.324 


5.309 


3 


6.201 


18.603 


7.412 


2 


12.181 


24,363 


9.707 


1 


20.162 


20.162 


8.033 


0 


30.142 


0,000 


0.000 


1 


42.123 


42,123 


16,783 








74.406 
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TABLE 1,3 





THE 


VARIANCE TEST T7 
OF THE P CLASS 
PERIOD MARCH 3, 


,BLE FOR SAMPLE 
DEMANDS DURING 
1968 - MAY 1 1 , 


3 

1968 


i 


Cl 


(i - m)*^ 


f*(i - m) 2 ^ 


U (i - 


0 


4 


8.536 


34.142 


1 1.686 


1 


10 


3.692 


36.924 


12.638 


2 


13 


, 849 


1 1.041 


3.779 


3 


6 


.006 


.037 


.013 


4 


7 


1 .163 


8.141 


2.787 


5 


7 


4.320 


30.239 


10.350 


6 


2 


9.477 


18.954 


6.487 


7 


1 


16.634 


16.634 


5.693 


8 


0 


25.791 


0.000 


0.000 


9 


0 


36.947 


0.000 


0.000 


10 


0 


50.104 


0.000 


0.000 


11 


0 


65.261 


0.000 


0.000 


12 


0 


82.418 


0.000 


0.000 


13 


1 


101.575 


101.575 


34.767 



88,202 



TABLE 1.4 



0 

1 

2 

3 

4 





VARIANCE TEST TABLE FOR SAMPLE 


4 




OF THE P CLASS 


DEMANDS DURING 




THE 


PERIOD MARCH 3, 


1968 - MAY 11 , 


1968 




(i “ m) 


f^(i - m} 2 *', 


f U {± " ; 


7 


4.997 


34.976 


15.647 


14 


1,526 


21.363 


9.557 


10 


,055 


.554 


.248 


8 


.585 


4.678 


2,093 


6 


3.114 


18.685 


8.359 


4 


7.644 


3^0 .574 


13.678 


2 , 


14.173 


28.346 


12.681 








62.263 



m) /m 
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TABLE 1,5 



VARIANCE TEST TABLE FOR SAMPLE 5 
OF THE P CLASS DEMANDS DURING 





THE 


PERIOD MARCH 3, 


1968 - MAY 1 1 , 


1968 


i 


VC 


(i - m)*" 


f -(i - 

A, 


f •(! - m) /m 
■ A~ 


0 


8 


5.912 


47.292 


19,451 


1 


12 


2,049 


24.586 


10,112 


2 


8 


.186 


1.489 


.612 


3 


1 1 


,323 


3.557 


1.463 


4 

5 


7 


2.461 


17,224 


7.084 


1 


6,598 


6.598 


2,714 


6 


1 


1 2 , 73 c 


12.735 


5.238 


7 


1 


20.8*/ 


20,872 


8,585 


8 


1 


31.010 


31,010 


12.754 


9 


1 


43 * 1 47 


43,147 


17.746 

85,758 







TABLE I 


.6 








VARIANCE TEST TABLE FOR SAMPLE 


6 






OF THE P CLASS 


DEMANDS DURING 






THE 


PERIOD MARCH 3, 


1968 - MAY 11, 


1968 


i 


S* 


(i - m) 


£^<i - m) 


f *(i - m)^7m 


0 


8 


5,444 


43,555 


18.667 


1 


10 


1.778 


17,778 


7,619 


2 


1 1 


.111 


1 „ 222 


.524 


"k 


10 


. 444 


4.444 


1.905 


4 


5 


2,778 


13.889 


5,952 


5 


6 


7,111 


42.667 


18.286 


6 


0 


13.444 


0,000 


0,000 


7 


1 


21,778 


21.778 


9.333 



62,286 
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TABLE 1.7 



0 

1 



VARIANCE TEST TABLE FOR SAMPLE 7 



THE 


PERIOD MARCH 3 , 


1968 - MAY 11 , 


1968 


Cc 


(i - m 


f v(i - m) 




7 


5.353 


37.473 


16.196 


10 


1,726 


17.259 


7.459 


13 


.098 


1.280 


.553 


1 1 


,471 


5.181 


2.239 


5 


2,844 


14.218 


6 . 145 


3 


7.216 


21.648 


9,356 


1 


13,589 


13.589 


5,873 


0 


21,961 


0.000 


0.000 


1 


32.334 


32,334 


13.975 






61 ,796 



O 
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TABLE 1.8 
VARIANCE TEST TABLE FOR SAMPLE 8 





OF THE P 


CLASS 


DEMANDS DURING 




THE 


PERIOD MARCH 3, 


1968 - MAY 1 1 , 


1968 




(i - m) 




f^-(i - m) 5 ^ 


u lL - ' 


9 


5.912 




53,204 


21.882 


7 


2.049 




14.342 


5.899 


13 


.186 




2.419 


.995 


11 


.323 




3.557 


1.463 


2 


2,461 




4.921 


2.024 


7 


6.598 




46.185 


13,995 


0 


12.735 




0.000 


0.000 


1 


20.872 




20.872 


8.585 


0 


31.010 
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0,000 


0,000 

59.843 
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TABLE 1,9 



VARIANCE TEST TABLE FOR SAMPLE 9 
OF THE P CLASS DEMANDS DURING 





THE 


PERIOD MARC! 


i 




(i - m) 


0 


6 


5,817 


1 


11 


1.993 


2 


12 


.170 


3 


9 


. 346 


4 


7 


2.523 


5 


3 


6.699 


6 


3 


12.875 



1968 - MAY 11, 


1968 




fU’U - 1 


34.900 


14,471 


21.924 


9.090 


2.035 


,844 


3.114 


1.291 


17.658 


7.321 


20.097 


8,333 


33.626 


16,016 




57,366 



TABLE 1.10 

VARIANCE TEST TABLE FOR SAMPLE 10 
OF THE P CLASS DEMANDS DURING 





THE 


PERIOD MARCH 3, 


1968 - MAY 11, 


1968 


i 


Sc 


(i - m)^ 




u a - 1 


0 


7 


5,629 


39,403 


1 6 u 6 0 8 


1 


9 


1.884 


16,955 


7.146 


2 


14 


.139 


1.943 


.819 


3 


10 


.394 


3.937 


1,659 


4 


6 


2,649 


15.892 


6.698 


5 


1 


6,903 


6,903 


2.910 


6 


3 


13,158 


39.475 


16.638 


7 


1 


21,413 


21,413 ] 


9,025 



61.504 



( 



O 

ERIC 
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APPENDIX J 



TABLES USED TO CALCULATE CHI SQUARED 
FOR FITTING THE NEGATIVE BINOMIAL DISTRIBUTION 
TO THE FUSSLER AND SIMON BOOK USE DATA 



The following tables show the cell value, cell 
frequency, theoretical probabilities, theoretical cell 
frequencies, the difference between the observed and 
theoretical cell frequencies, the differences squared, 
and the component of Chi Squared for each cell for the 
data in Chapter 8 , 





TABLE J.1 



CIII SQUARED TABLE FOR FITTING 
THE NEGATIVE BINOMIAL DISTRIBUTION 
TO SAMPLES OF BOOK USE 
FOR THE PHILOSOPHY SAMPLE AT CHICAGO 
WHEN COMPARED WITH YALE 



VALUE 


observed 

FREQUENCY 


PROBABILITY 


THEORETICAL 

FREQUENCY 


DIFFERENCE 


CHI 

SQUARED 


0 


85 


,66136 


87.300 


-2,2995 


,0606 


1 


21 


.12826 


16.930 


4.0697 


,9783 


2 


12 


.06562 


8.662 


3.3382 


1,2865 


3 


3 


.04053 


5.350 


-2. 3500 


1,0327 


4 


4 


.02717 


3.586 


.4136 


,0 477 


5+ 


7 


.07706 


10,172 


-3.1719 


,9891 




132 


1.00000 


132.000 


. 0000 


4.3943 



TABLE J» 2 

CHI SQUARED TABLE FOR FITTING 
THE NEGATIVE BINOMIAL DISTRIBUTION 
TO SAMPLES OF BOOK USE 
FOR THE PHILOSOPHY SAMPLE AT YALE 

OBSERVED THEORETICAL 



VALUE 


FREQUENCY 


PROBABILITY 


FREQUENCY 


DIFFERENCE 


SQUARED 


0 


90 


.69357 


91.5512 


-1 .5512 


,0263 


1 


18 


.10983 


14. £'975 


3.5025 


. 8 <'• f 2 


2 


7 


.05616 


7.4131 


-.4131 


.0230 


3 


3 


.03532 


4,6622 


-1 .6622 


. 5926 


4 


6 


.02429 


3.2063 


2,7937 


2 .££342 


5+ 


8 


,08083 


10,6695 


-2,6695 


.6679 




132 


1.00000 


132.000 


,0000 


4.5902 



TABLE J.3 



CHI SQUARED TABLE FOP. FITTING 
THE NEGATIVE BINOMIAL DISTRIBUTION 
TO SAMPLES OF BOOK USE 

FOR THE TEUTONIC LANGUAGES SAMPLE AT CHICAGO 
WHEN COMPARED WITH YALE 



OBSERVED THEORETICAL CITI 



VALUE 


FREQUENCY 


PROBABILITY 


FREQUENCY 


DIFFERENCE 


SQUARED 


0 


109 


,74362 


110.799 


-1.7994 


,0292 


1 


19 


.09162 


13.651 


5.3486 


2,0956 


2 


7 


.04592 


6.842 


.1579 


.0036 


3 


5 


.02880 


4.291 


.7088 


,1171 


4 


2 


.01988 


2,962 


-.9621 


,3125 


5 + 


7 


.07016 


10. 454 


-3.4538 


1,1411 




149 


1.00000 


149.000 


.0000 


3,6992 



TABLE J.4 

CHI SQUARED TABLE FOP. FITTING 
THE NEGATIVE BINOMIAL DISTRIBUTION 
TO SAMPLES OF BOOK USE 
FOR THE TEUTONIC LANGUAGES SAMPLE AT YALE 





OBSERVED 




THEORETICAL 




CITI 


VALUE 


FREQUENCY 


PROBABILITY 


FREQUENCY 


DIFFERENCE 


SQUARED 


0 


112 


,76432 


113.804 


-1.8837 


,0312 


1 


22 


.09542 


14.218 


7.7824 


4,2599 


2 


6 


.04574 


6.815 


-.8153 


,0975 


3 


3 


.02733 


4.072 


-1.0722 


.2823 


4 


1 


.01795 


2.675 


-1 .6745 


1 , 8484 


5+ 


5 


.04924 


7.337 


-2.3368 


.74 43 




149 


1.00000 


149.000 


.0000 


6.4636 
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TABLE J. 5 



CHI SQUARED TABLE FOR FITTING 
THE NEGATIVE BINOMIAL DISTRIBUTION 
TO SAMPLER OF BOOK USE 

FOR THE TEUTONIC LANGUAGES SAMPLE AT CHICAGO 
THEN COMPARED WITH NORTHWESTERN 



OBSERVED THEORETICAL CHI 



VALUE 


FREQUENCY 


PROBABILITY 


FREQUENCY 


DIFFERENCE 


SQUARED 


0 


39 


.64548 


39.374 


-.3743 


,0036 


1 


5 


.10277 


6.269 


-1 .2600 


,2569 


2 


3 


.05547 


3.384 


-.3837 


,0435 


3 


1 


.03698 


2,256 


-1.2558 


.6991 


4 


4 


.02700 


1.647 


2.3530 


3, 3616 


5 + 


9 


.13230 


8.070 


.9297 


,1071 




61 


1.00000 


61 .000 


. 0000 


4.4717 



TABLE J, 6 

CIII SQUARED TABLE FOR FITTING 
THE NEGATIVE BINOMIAL DISTRIBUTION 
TO SAMPLES OF BOOK USE 

FOR THE TEUTONIC LANGUAGES SAMPLE AT NORTHWESTERN 



OBSERVED THEORETICAL CHI 



VALUE 


FREQUENCY 


PROBABILITY 


FREQUENCY 


DIFFERENCE 


SQUARED 


0 


46 


.76059 


46.396 


-.3960 


.0034 


1 


7 


.09633 


5.876 


1.1239 


.2150 


2 


2 


,04635 


2.327 


-.8273 


. 2421 


3 


3 


.02773 


1.695 


1.3054 


1.0056 


4 


1 


.01329 


1,116 


-.1157 


.0120 


5 + 


2 


,05066 


3,090 


-1 .0^03 


, 3846 




61 


1,00000 


61.000 


, 0000 


1.8627 
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O 
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TAB 



LE J, 7 



CIII SQUARED TARjLE FOR FITTING 
THE NEGATIVE BINOMIAL DISTRIBUTION 
TO SAMPLES OF BOOK USE 

FOR THE TEUTONIC LANGUAGES SAMPLE AT CHICAGO 
WHEN COMPARED WITH CALIFORNIA 



OBSERVED THEORETICAL CHI 



VALUE 


FREQUENCY 


PROBABILITY 


FREQUENCY 


DIFFERENCE 


SQUARED 


0 


125 


,75067 


126,863 


-1.8632 


.0274 


1 


23 


.08872 


14.994 


8.0063 


4,2752 


2 


-> 


.04439 


7.502 


-5.5019 


4.0351 


3 


8 


,02787 


4,710 


3,2900 


2.2981 


4 


1 


.01927 


3.257 


-2,2566 


1.5637 


5+ 


10 


,06908 


11,675 


-1,6745 


.2402 




169 


1,00000 


169,000 


.0000 


12.4396 



TABLE J.8 

CHI SQUARED TABLE FOR FITTING 
THE NEGATIVE BINOMIAL DISTRIBUTION 
TO SAMPLES OF BOOK USE 

FOR THE TEUTONIC LANGUAGES SAMPLE AT CALIFORNIA 



OBSERVED THEORETICAL CHI 



VALUE 


FREQUENCY 


PROBABILITY 


FREQUENCY 


DIFFERENCE 


SQUARED 


0 


125 


.74685 


126.218 


-1.2176 


.0117 


1 


21 


.10427 


17.622 


3.3784 


.6477 


2 


7 


.05002 


8.4 53 


-1.4534 


. 2499 


3 


7 


.02967 


5.014 


1.9858 


,7864 


4 


3 


,01928 


3.258 


-.2583 


.0205 


5 + 


6 


,04991 


8.435 


-2,4348 


.7028 




169 


1.00000 


169.000 


. 0000 


2.4190 




TABLE iT. 9 



CHI SQUARED TABLE FOR PITTING 
THE NEGATIVE BINOMIAL DISTRIBUTION 
TO SAMPLES OP BOOK USE 
FOR THE BIOLOGY SAMPLE AT CHICAGO 
MIEN COMPARED T7ITII YALE 



VALUE 


OBSERVED 

FREQUENCY 


PROBABILITY 


THEORETICAL 
l FREQUENCY 


DIFFERENCE 


CUX 

SQUARED 


0 


81 


,71655 


02.403 


-1.4032 


,0239 


1 


10 


.07755 


8,918 


1,0017 


,1312 


2 


4 


.04006 


4.699 


-.6909 


.1040 


3 


7 


.02724 


3,133 


3.8674 


4.7746 


4 


2 


.02005 


2,306 


-.3057 


, 0405 


5 + 


1 1 


.11775 


13,541 


-2.5412 


,4769 




1 1 5 


1,00000 


115.000 


,0000 


5,5511 





TABLE J.10 

CHI SQUARED TABLE FOR FITTING 
THE NEGATIVE BINOMIAL DISTRIBUTION 
TO SAMPLES OF BOOK USE 
FOR THE BIOLOGY SAMPLE AT YALE 

OBSERVED THEORETICAL 


CPI 


VALUE 


FREQUENCY PROBABILITY 


FREQUENCY 


DIFFERENCE 


SQUARED 


0 


89 


,78055 


89.763 


-.7632 


.0065 


1 


7 


. 05782 


6.649 


,3507 


,0185 


2 


3 


.02988 


3.436 


-.4362 


,0554 


3 


3 


.01986 


2.284 


.7161 


,2245 


4 


2 


,01466 


1.686 


.3141 


.0585 


5 + 


11 


.09723 


11.181 


-.1814 


.0029 




115 


1.00000 


115.000 


.0000 


.3664 



ERIC 
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TABLE J.11 



CHI SQUARED TABLE POP FITTING 
THE NEGATIVE BINOMIAL DISTRIBUTION 
TO SAMPLES OF BOOK USE 
FOR TIIE BIOLOGY SAMPLE AT CHICAGO 
WHEN COMPARED WITH CALIFORNIA 



VALUE 


OBSERVED 

FREQUENCY 


PROBABILITY 


THEORETICAL 

FREQUENCY 


DIFFERENCE 


CIII 

SOTJARFD 


0 


72 


.68776 


72.903 


-.9026 


.0112 


1 


17 


. 12406 


13,150 


3.8496 


1 . 1 269 


2 


7 


.06181 


6.552 


. 4 481 


. 8307 


3 


5 


.03735 


3.959 


1.0409 


.2737 


4 


0 


.02455 


2.602 


-2.6023 


2.6023 


5+ 


5 


.06447 


6.834 


-1 ,8338 


. 4921 




106 


1 . 00000 


106,000 


. 0000 


4.5368 









TABLE J. 12 










CHI SQUARED 


TABLE FOR FITTING 








THE NEGATIVE BINOMIAL DISTRIBUTION 








TO SAMPL 


ES OF BOOK US 


‘B 








FOR THE BIOLOGY 


SAMPLE AT CALIFORNIA 






OBSERVED 


THEORETICAL 




CHI 


VALUE 


FREQUENCY PROBABILITY 


FREQUENCY 


DIFFFRFTTCE 


SOTJARFD 


0 


59 


.57919 


61.394 


-2.3941 


, 0934 


1 


18 


.14443 


15.310 


2.6904 


,4728 


2 


10 


.07880 


8.353 


1.6472 


. 3248 


3 


5 


.05077 


5.382 


-.3816 


,0271 


4 


6 


,03521 


3.732 


2.2677 


1,3779 


5+ 


8 


,11160 


11,830 


-3.8296 


1 , 2398 




106 


1.00000 


106.000 


. 0000 


3.5357 



201 

820 



o 



TABLE J. 1 3 



CUT SQUARED TABLE FOR FITTING 
THE NEGATIVE BINOMIAL DISTRIBUTION 
TO SAMPLES OF BOOK USE 
FOR THE PHYSICS SAMPLE AT CHICAGO 
WHEN COMPARED WITH NORTHWESTERN 



VALUE 


OBSERVED 

FREQUENCY 


PROBABILITY 


THEORETICAL 

FREQUENCY 


DIFFERENCE 


CHI 

SQUARED 


0 

1 

2 

3 

4 

5+ 


28 

14 

9 

4 

6 

20 


,42207 

,12089 

,07420 

.05365 

,04171 

,28748 


34,188 

9.792 

6.010 

4,346 

3.379 

23,286 


-6.1877 

4.2079 

2,9898 

-.3456 

2.6215 

-3.2859 


1.1199 

1 .8082 
1.4873 
.0275 
2,0341 
.4637 




81 


1.00000 


81 .000 


,0000 


6,9407 








TABLE J. 14 







CHI SQUARED TABLE FOR FITTING 
THE NEGATIVE BINOMIAL DISTRIBUTION 
TO SAMPLES OF BOOK USE 
FOR THE PHYSICS SAMPLE AT NORTHWESTERN 





OBSERVED 




THEORETICAL 




CITI 


VALUE 


FREQUENCY 


PROBABILITY 


FREQUENCY 


DIFFERENCE 


SQUARED 


f) 


44 


.55442 


44,908 


-.9000 


,0184 


i 


13 


,14421 


11 .681 


1.3190 


,1489 


i 


4 


.08052 


6.522 


-2.5221 


.9753 


3 


3 


.05297 


4.291 


-1 .2906 


.3882 


4 

5+ 


3 


.03748 


3,036 


-.0359 


,0004 


14 


,13040 


10,562 


3.4376 


1,1188 


O 

ERIC 


81 ■ 


1,00000 

202 

221 


81,000 


.0000 


2,6500 



TABLE J. 1 5 



CHI SQUARED TABLE FOR FITTING 
THE NEGATIVE BINOMIAL DISTRIBUTION 
TO SAMPLES OF BOOK USE 
FOR THE ECONOMICS SAMPLE AT CHICAGO 
WHEN COMPARED WITH NORTHWESTERN 



OBSERVED 



VALUE 


FREQUENCY 


PROBABILITY 


0 


55 


.39353 


1 


16 


.18809 


2 


7 


.11859 


3 


1 1 


. 08079 


4 


0 


.05710 


5 + 


10 


.16190 




09 


1 .00000 



'HEORETICAL 




CHI 


FREQUENCY 


DIFFERENCE 


SQUARED 


35.024 


19.9758 


11 . 3931 


16.740 


-.7400 


.0327 


10.555 


-3,5545 


1,1971 


7.190 


-6.1903 


5.3294 


5.082 


-5.0819 


5.0819 


14.409 


-4.4091 


1,3492 


89,000 


. 0000 


24.3833 



TABLE A, 16 

CHI SQUARED TABLE FOR FITTING 
THE NEGATIVE BINOMIAL DISTRIBUTION 



TO SAMPLES OF BOOK USE 
FOR THE ECONOMICS SAMPLE AT NORTHNE S TF FJT 



VALUE 

0 

1 

2 

3 

4 

5+ 



OBSERVED 

FREQUENCY 


PROBABILITY 


THEORETICAL 

FREQUENCY 


DIFFERENCE 


CHI 

SQUARED 


55 


,47636 


42,396 


12,6040 


.3.7470 


20 


.2981 4 


26.534 


-6,5345 


1 .6092 


6 


.13726 


12.216 


-6,2161 


3.1631 


2 


.05562 


4,950 


-2.9502 


1.7582 


2 


,02101 


1.870 


.1301 


.0091 


4 


,01161 


1,033 


2,9667 


8,5178 


89 


1 ,00000 


89.000 


.0000 


18,804 4 



O 

ERIC 
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TABLE J, 17 



CHI SQUARED TABLE FOR FITTING 
THE NEGATIVE BINOMIAL DISTRIBUTION 
TO SAMPLES OF BOOK USE 
FOR THE ECONOMICS SAMPLE AT CHICAGO 
WHEN COMPARED WITH CALIFORNIA 



VALUE 


OBSERVED 

FREQUENCY 


PROBABILITY 


THEORETICAL 

FREQUENCY 


DIFFERENCE 


CHI 

SQUARED 


0 


69 


.63967 


60. 129 


8.8710 


1.3088 


1 


16 


.24683 


23.202 


-7.2020 


2.2355 


2 


5 


.07968 


7.490 


-2.4899 


.8277 


3 


2 


-02405 


2,261 


-.2607 


.0301 


4 


0 


.00700 


,658 


-.6580 


.6580 


5 + 


2 


.00277 


.260 


1,7396 


11.6225 




94 


1.00000 


94,000 


.0000 


16,6827 



TABLE J.18 

CHI SQUARED TABLE FOR FITTING 
THE NEGATIVE BINOMIAL DISTRIBUTION 
TO SAMPLES OF BOOK USE 
FOR THE ECONOMICS SAMPLE AT CALIFORNIA 



OBSERVED THEORETICAL CHI 



VALUE 


FREQUENCY 


PROBABILITY 


FREQUENCY 


DIFFERENCE 


SQUARED 


0 


57 


.41034 


38.572 


18.4280 


8,8041 


1 


17 


.31876 


29.963 


-12.9634 


5.6085 


2 


8 


.16277 


15,300 


-7,3004 


3.4833 


3 


3 


.06867 


6,455 


-3.4550 


1.8493 


4 


3 


.02593 


2.437 


.5626 


.1298 


5 + 


6 


,01353 


1.272 


4.7282 


17,5455 




94 


1.00000 


94.000 


. 0000 


37.5455 
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APPENDIX K 

TABLES USED TO CALCULATE CHI SQUARED 
FOR THE VARIANCE TEST FOR THE 
FU8SLER AND SIMON BOOK USE DATA 



The following tables shew the cell value, 
cell frequency, the square of the difference 
between the cell value and the mean, the frequency 
of the cell value multiplied by the difference, 
and the component of Chi Squared for the cell for 
the data in Chapter 8, 
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i 

0 

1 

2 

3 

4 

5 

i 

0 

1 

-5 

3 

4 

5 



TABLE K. 1 



VARIANCE TEST TABLE FOR BOOK USE OF 
THE PHILOSOPHY SAMPLE AT CHICAGO 
WHEN COMPARED WITH YALE 





f* u (i - m) % 



(i - in) 



05 


1.291 


109,762 


21 


.019 


. 390 


12 


.746 


8.950 


3 


3.473 


10.419 


4 


8.200 


32,802 


7 


14,928 


104.494 



96 . 591 
« 344 

7.876 

9.169 

28,866 

91.955 



234 . POO 



TABLE K. 2 



VARIANCE TEST TABLE FOR BOOK USE 
OF THE PHILOSOPHY SAMPLE AT YALE 



f- 

L 


(i - mf 


(i - m) 


j=. (i - ; 


90 


1.361 


122,496 


,,;4. 9 88 


18 


,028 


. 5 0 M 


, U 


7 


,694 


4.858 


4.164 


3 


3.361 


10,083 


8 . 6 4 2 


6 


8,028 


48.168 


4.129 


8 


14.694 


117.552 


100.759 








223.113 



206 






TABLE Km 3 



0 

1 

*> 



VARIANCE TEST TABLE FOR BOOK USE OF 
THE TEUTONIC LANGUAGES SAMPLE AT CHICAGO 
WHEN COMPARED WITH YALE 



109 

19 

7 



l 

(i - m) 


f- (i - m) 


^(i - : 


1.041 


113.433 


111.194 


.000 


.008 


.008 


.960 


6,721 


6.588 


3.920 


19.599 


19.213 


8.800 


17.759 


17.409 


15.839 


110.876 


108.688 






263.099 



i 

0 

1 

o 

d,m 

3 

4 

5 



VARIANCE TEST TABLE FOR BOOK USE 
OF THE TEUTONIC LANGUAGES SAMPLE AT YALE 



(i - m) 



f C (i - m) 



f. ? (i - n) /m 

II 



112 .565 

22 .062 

6 1.550 

3 5.055 

1 10.552 

5 18.048 



63.283 

1.357 

9.350 

15.165 

10.552 

90.241 



84.188 

1.805 

12.432 

20.175 

14.037 

120.053 
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TABLE K.5 



VARIANCE TEST TABLE FOR BOOK USE OF 
THE TEUTONIC LANGUAGES SAMPLE AT CHICAGO 
WHEN COMPARED WITH NORTHWESTERN 



i 


fl 


(i - m) 


f- (i - mf 


f. (i - 


0 


39 


4.000 


156.000 


78.000 


1 


5 


1.000 


5.000 


2.500 


2 


3 


0,000 


0.000 


0.000 


3 


1 


1.000 


1.000 


.500 


4 


4 


4,000 


16.000 


8.000 


5 


9 


9.000 


81 . 000 


40.500 



129 . son 



TABLE K. 6 

VARIANCE TEST TABLE FOR BOOK USE 
OF THE TEUTONIC LANGUAGES SAMPLE AT NORTHWESTERN 



0 

1 

2 

3 

4 



46 

7 

2 

3 

1 

7 



(i - m) 



.953 

1.512 
4.971 
10.430 
17. 



n <i 

27.308 



m) 



3.023 

14.912 

10.430 

35.778 



(i - m) /m 



35.443 
.4 79 
3.924 
19.354 
13.536 
46.435 



119.170 
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cm m cr m *H o r- cm n a- in 



TABLE K.7 



0 

1 



VARIANCE TEST TABLE FOR BOOK USE OF 
THE TEUTONIC LANGUAGES SAMPLE AT CPICAGO 
UIIEN COMPARED WITH CALIFORNIA 



125 

23 



8 

1 

10 



(i - m) 


f- (i - m)* 


ft (i - 


1,012 


126.484 


125.740 


. 000 


.001 


.001 


,988 


1,976 


1 . 965 


3.976 


31.811 


31.624 


8.965 


8.965 


8,912 


15,953 


159.527 


158.588 



m) 2, / m 



326,829 



TABLE IC, 8 

VARIANCE TEST TABLE FOR BOOK USE 
OF THE TEUTONIC LANGUAGES SAMPLE AT CALIFORNIA 



125 

21 

7 

7 



(i - m) 



t c a 



m) 



(i - m) 



.601 


75.107 


96.894 


.051 


1.062 


1 , 370 


1.500 


10.502 


13,548 


4.950 


34.650 


44.701 


10.400 


31.199 


40.249 


17.849 


107.096 


138.162 



334,923 
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TABLE K. 9 



0 

1 

2 

3 

4 

5 



VAPIANCE TEST TABI^E FOR BOOK USE OF 
THE BIOLOGY SAMPLE AT CHICAGO 
WHEN COMPARED WITH YALE 



81 

10 

4 

7 

2 

11 



(i - m) 

3.965 

.983 

.000 

1.017 

4.035 

9.052 



f ; (i 

321 



in) 



197 

827 

000 

122 

070 



f t (i 

161 .795 
4.935 
,000 
3,577 
4,053 
50.005 



z 

m) /m 



223.865 



TABLE K. 10 







VARIANCE TEST TABLE FOR BOOK 


USE 






OF THE BIOLOGY 


SAMPLE AT YAJ 


:.e 






% 


Z 




i 


ft 


(i - m) 


f^(i - m) 




0 


89 


3.366 


299.611 


163.295 


1 


7 


.697 


4,978 


2.659 


2 


3 


.027 


,082 


,04 5 


3 


3 


1.358 


4.073 


2.220 


4 


2 


4,688 


9.376 


5.110 


5 


11 


10,019 


110.205 


60.064 
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233.393 
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ERIC 



O 1 



i 

0 

1 

2 

3 

4 

5 

i 

0 

1 

2 

3 

4 

5 

O 

ic 



n h h to c 'in - j cTi 'Co c in vo cc 



TABLE K. 1 1 



VARIANCE TEST TABLE FOR BOOK TJSE OF 
THE BIOLOGY SAMPLE AT CHICAGO 
WHEN COMPARED WITH CALIFORNIA 



f* (i - m f 



f L Ci 




f • 

” L 



(i 



m) Z /m 



. 963 

, 000 
1 ,038 
4,076 
9 . 1*4 
16.131 



69.308 
.006 
7.267 
20. 379 
0.000 
80.757 



70.641 
. 006 
7,4 06 
20.771 
0 . 000 
82.310 



181.135 



TABLE K. 12 



VARIANCE TEST TABLE FOR BOOK USE 
OF THE BIOLOGY SAMPLE AT CAL I FORM I A. 



(i 




f: (1 




(i 



z , 

m) /m 



2.482 
. 331 
.180 
2.029 
5.878 
11 .727 



146.44 4 
5.961 
1 . 802 
10.146 
35.270 
93.819 



92.953 
3.784 
1.144 
6.4 40 
22.387 
59,550 



1 86,258 



21 1 




TABLE K. 13 



VARIANCE TEST TABI.E FOR BOOK USE OF 
THE PHYSICS SAMPLE AT CHICAGO 
WHET COMPARED WITH TTORTHT7E STE RN 



A 




(i - m)* 


f’ L (i - m) 2. 


±i (A - 


0 


28 


23.781 


665.858 


136.543 


1 


14 


15.028 


210.386 


43.142 


2 


9 


8.274 


74.870 


15.271 


3 


4 


3.521 


14.086 


2.888 


4 


6 


.768 


4.610 


.94 5 


5 


20 


.015 


.305 


, .063 



198.853 



TABLE IC.14 



i 



VARIANCE TEST TABLE FOR BOOK USE 
OF THE PHYSICS SAMPLE AT NORTHWESTERN 

f; (A — m) f ; (i — m) f # * Ci — m) /m 

W I* w 



0 44 

1 13 

2 4 

3 3 

4 3 

5 14 



3.294 
.66 if 
.03 4 
1 .405 
4,775 
10,145 



144.916 

8.631 

.137 

4.214 

14.325 

142.036 



79.853 
4.756 
. 076 
2 . 322 
7.893 
78.265 



173.1 64 
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w ro =t in «h c t- n n cr in 



TABLE X ' .'15 



i 



VARIANCE TEST TABLE FOR BOON USE OF 
THE ECONOMICS SAMPLE AT CHICAGO 
WHEN COMPARED WITH NORTHWESTERN 



f i “ m > Z 



f : (i 




(i - m) 



0 

1 



55 

15 



4. 850 
1 .445 



266.705 

23.126 



121.120 

10.501 



7 .041 

1 6 636 

0 3.232 

10 7.827 



. 286 



. 636 

0.000 

78.274 



.130 

.289 
0.000 
35 .503 



167.587 



TABLE K, 1 6 



VARIANCE TEST TABLE FOR BOOK USE 
OF THE ECONOMICS SAMPLE AT NORTHWESTERN 



(i - m)* 




f £ (± - m) 



55 .788 43.335 

20 .013 .252 



48.820 

.284 



6 1.237 
2 4.462 
2 9.687 
4 16.912 



7.424 

8.924 

19.374 

67.646 



8.364 
10. 054 
21 .826 
76.209 



165.557 
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UJ 4= W M -* O H- UlJSUMao 



TABLE K. 17 



i 



VARIANCE TEST TABT.E FOR BOOK USE OF 
THE ECONOMICS SAMPLE AT CHICAGO 
WIDEN COMPARED WITH CALIFORNIA 



(i 



m) 



1 C U 



m) 



f / (i - m ) Zm 



69 .272 

16 .229 

5 2.1 87 

2 6.144 

0 12.101 

2 20.059 



18,750 

3,667 

10.933 

12.288 

0,000 

40.118 



35,968 
7.034 
20c 973 
23.573 
0.000 
76.960 



164.509 



TABLE K. 1 8 



VARIANCE TEST TABLE FOR BOOK USE 
OF THE ECONOMICS SAMPLE AT CALIFORNIA 

t Z 2. . 

£; (i - m) £; (i - m) f' (i - m) /m 

L ft 



57 1.065 

17 .001 

8 .937 

3 3.873 

3 8.810 

6 15.746 



60.696 
.017 
7.490 
11 .62 0 
26.429 
94.474 



58,819 

.017 

7.266 

11.261 

25.611 

91.553 



194.527 
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